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Definition of some units 


Angstróm 1A = 107'°m (order of magnitude of 
the atomic dimensions) 
Fermi IF =10°%m (order of magnitude of 
the nuclear dimensions) 
Barn lb = 10778 m? = (107* AY = (10 FP 
Electron Volt l eV = 1.602 189(5) x 10719 joule 
Useful orders of magnitude 
Electron rest energy : mec? = 0.5 MeV [0.511 003(1) x 10% eV] 
Proton rest energy : Mpc? = 1 000 MeV [938.280(3) x 10% eV] 
Neutron rest energy : M,c? = 1 000 MeV [939.573(3) x 10% eV] 


One electron volt corresponds to: 


a frequency v œ 2.4 x 10'* Hz through the relation E = hv 
[2.417 970(7) x 101: Hz] 


a wavelength 2 = 12000A through the relation 4 = c/v 

[12 398.52(4) A] 
a wave number ; = 8 000 cm7’ [8 065.48 (2) cin” *] 
a temperature T ~ 12000 K through the relation E = kgT | 


[11 604.5(4) K] 


In a | gauss magnetic field (107 * Tesla) : 


the electron cyclotron frequency v, = w,/21 = — qB/2nm, 

is v = 2.8 MHz [2.799 225(8) x 10% Hz] 
the orbital Larmor frequency v, = wL/2n = — ugB/h = v.2 

is v, = 1.4 MHz [1.399 612(4) x 10° Hz] 


(this corresponds by definition to a g = 1 Landé factor) 
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Some general physical constants 


h = 6.626 18(4) x 1073* joule second 
Planck’s constant h 

h = - 1.054 589(6) x 107 ** joule second 
Speed of light (in vacuum) = 2.997 924 58(1) x 10% m/s 
Electron charge q = — 1.602 189(5) x 107!° coulomb 
Electron mass m, = 9.109 53(5) x 107?! kg 
Proton mass M, = 1.672 65(1) x 107?” kg 
Neutron mass M, = 1.674 95(1) x 107?” kg 

M 


—2 = 1 836.1515(7) 


A, = h/m,c = 2.426 309(4) x 107? A 


Electron Compton wavelength 
å. = fifm,c = 3.861 591(7) x 107% A 


Fine structure constant — qq _e_ 1 
(dimensionless) Ane fie he “he 137.0360(1) 
Bohr radius Ay -4 = 0.529 177 1(5)A 


Hydrogen atom ionization 
energy (without proton recoil — Ej, = a?m,c?/2 = 13.605 80(5) eV 
effect) 


Rydberg’s constant R, = — Ey, /he = 1.097 373 18(8) x 10% cm”! 

” Classical” electron radius r, = ana = 2.817 938(7) fermi 

Bohr magneton Up = qh/2m, = — 9.274 08(4) x 107 ?* joule/tesla 
Electron spin g factor Je = 2 x 1.001159 657(4) 

Nuclear magneton Hy = — qh/2M, = 5.05082(2) x 107?” joule/tesla 


Boltzmann's constant kp = 1.380.66(4) x 107?* joule/K 
Avogadro’s number N, = 6.022 05(3) x 10% 
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Useful Identities 


U : scalar field; A, B, ... : vector fields. 


Vx (VU) =0 Vv. (VU) = AU 
V.(Vx A)=0 Vx (Vx A)=V(V.A)-AA 
b=", x V 
ye aa xL 
rór hr? 
10? L? 
par hr? 


A x (B x C) = (A. C)B — (A. B)C 
A x (B x C) + B x (C x A) + C x (A x B) =0 
(A x B).(C x D) = (A. C)(B . D) — (A. D)(B . C) 


(A x B)x (C x D) = [(A x B). DJC — [(A x B). C]D 
[(C x D). A]B — [(C x D). BJA 


V (UV) =UW +V VU 

A (UV) = UAV + (VU). (WV) + V AU 

V. (UA) =UV.A+A.VU 

Vx(UA) =U VxA + (VU) x A 

V.(A x B) =B.(VxA)- A.(VxB) 

Y (A.B) =A x (Vx B)+Bx(Vx A)+B.V A+A.V B 


Vx (Ax B)=A (V.B)—B(V.A)+B.V A-A.V B 


N.B. : B.VA vector field whose components are : 
(B.V A) = BA =] B- 4, 
2 E pyr 5 J ôx; i 


(i = x, y, z) 
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A. INTRODUCTION 


A. INTRODUCTION 


1. Importance of collision phenomena 


Many experiments in physics, especially in high energy physics, consist of 
directing a beam of particles (1) (produced for example, by an accelerator) onto 
a target composed of particles (2), and studying the resulting collisions : the various 
particles* constituting the final state of the system — that is, the state after the 
collision (cf. fig. 1) — are detected and their characteristics (direction of emission, 
energy, etc.) are measured. Obviously, the aim of such a study is to determine the 
interactions that occur between the various particles entering into the collision. 


Detector 


5 
P 


: et 
Incident beam ZM 
>> > —_ fe pees ee 


particles (1) 


particles (2) 


Detector 


FIGURE I 


Diagram of a collision experiment involving the particles (1) of an incident beam and the particles (2) 
of a target. The two detectors represented in the figure measure the number of particles 
scattered through angles 0, and 6, with respect to the incident beam. 


The phenomena observed are sometimes very complex. For example, if 
particles (1) and (2) are in fact composed of more elementary components (protons 
and neutrons in the case of nuclei), the latter can, during the collision, redistribute 
themselves amongst two or several final composite particles which are different 
from the initial particles; in this case, one speaks of “rearrangement collisions”. 


* In practice, it is not always possible to detect all the particles emitted, and one must often be 
satisfied with partial information about the final system. 
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Moreover, at high energies, the relativistic possibility of the “materialization” of 
part of the energy appears: new particles are then created and the final state can 
include a great number of them (the higher the energy of the incident beam, the 
greater the number). Broadly speaking, one says that collisions give rise to reactions, 
which are described most often as in chemistry : 


dg) + D ~ @+@+6)+.. (A-1) 


Amongst all the reactions possible* under given conditions, scattering reactions 
are defined as those in which the final state and the initial state are composed of 
the same particles (1) and (2). In addition, a scattering reaction is said to be elastic 
when none of the particles’ internal states change during the collision. 


2. Scattering by a potential 


We shall confine ourselves in this chapter to the study of the elastic scattering 
of the incident particles (1) by the target particles (2). If the laws of classical mecha- 
nics were applicable, solving this problem would involve determining the deviations 
in the incident particles’ trajectories due to the forces exerted by particles (2). For 
processes occurring on an atomic or nuclear scale, it is clearly out of the question to 
use classical mechanics to resolve the problem; we must study the evolution of the 
wave function associated with the incident particles under the influence of their 
interactions with the target particles [which is why we speak of the “scattering” of 
particles (1) by particles (2)]. Rather than attack this question in its most general 
form, we shall introduce the following simplifying hypotheses : 


(1) We shall suppose that particles (1) and (2) have no spin. This simplifies the 
theory considerably but should not be taken to imply that the spin of particles is 
unimportant in scattering phenomena. 

(ii) We shall not take into account the possible internal structure of particles (1) 
and (2). The following arguments are therefore not applicable to “inelastic” 
scattering phenomena, where part of the kinetic energy of (1) is absorbed in the 
final state by the internal degrees of freedom of (1) and (2) (cf. for example, the 
experiment of Franck and Hertz). We shall confine ourselves to the case of elastic 
scattering, which does not affect the internal structure of the particles. 

(iii) We shall assume that the target is thin enough to enable us to neglect 
multiple scattering processes; that is, processes during which a particular incident 
particle is scattered several times before leaving the target. 

(iv) We shall neglect any possibility of coherence between the waves scattered 
by the different particles which make up the target. This simplification is justified 
when the spread of the wave packets associated with particles (1) is small compared 
to the average distance between particles (2). Therefore we shall concern ourselves 
only with the elementary process of the scattering of a particle (1) of the beam by 
a particle (2) of the target. This excludes a certain number of phenomena which 


* Since the processes studied occur on a quantum level, it is not generally possible to predict 
with certainty what final state will result from a given collision; one merely attempts to predict the 
probabilities of the various possible states. 
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are nevertheless very interesting, such as coherent scattering by a crystal (Bragg 
diffraction) or scattering of slow neutrons by the phonons of a solid, which provide 
valuable information about the structure and dynamics of crystal lattices. When 
these coherence effects can be neglected, the flux of particles detected is simply the 
sum of the fluxes scattered by each of the M target particles, that is, ./” times the 
flux scattered by any one of them (the exact position of the scattering particle inside 
the target is unimportant since the target dimensions are much smaller than the 
distance between the target and the detector). 

(v) We shall assume that the interactions between particles (1) and (2) can be 
described by a potential energy V(r, — r,), which depends only on the relative 
position r = r, — r, of the particles. If we follow the reasoning of $B, chapter VII, 
then, in the center-of-mass reference frame* of the two particles (1) and (2), the 
problem reduces to the study of the scattering of a single particle by the potential V(r). 
The mass u of this “relative particle” is related to the masses m, and m, of (1) and (2) 
by the formula : 


(A-2) 


3. Definition of the scattering cross section 


Let Oz be the direction of the incident particles of mass y (fig. 2). The poten- 
tial V(r) is localized around the origin O of the coordinate system [which is in fact 
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FIGURE 2 


The incident beam, whose flux of particles is F,, is parallel to the axis Oz; it is assumed to be 
much wider than the zone of influence of the potential V(r), which is centered at O. Far from 
this zone of influence, a detector D measures the number dn of particles scattered per unit 
time into the solid angle dQ, centered around the direction defined by the polar angles 6 and ọ. 
The number dv is proportional to F, and to dQ; the coefficient of proportionality o(6, p) is, 
by definition, the scattering “cross section” in the direction (0, ¿p). 


* In order to interpret the results obtained in scattering experiments, it is clearly necessary 
to return to the laboratory reference frame. Going from one frame of reference to another is a simple 
kinematic problem that we will not consider here. See for example Messiah (1.17). vol. I. chap. X. § 7. 
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the center of mass of the two real particles (1) and (2)]. We shall designate by F, 
the flux of particles in the incident beam, that is, the number of particles per unit 
time which traverse a unit surface perpendicular to Oz in the region where z takes on 
very large negative values. (The flux F; is assumed to be weak enough to allow us 
to neglect interactions between different particles of the incident beam.) 

We place a detector far from the region under the influence of the potential and 
in the direction fixed by the polar angles @ and ọ, with an opening facing O and 
subtending the solid angle dQ (the detector is situated at a distance from O which 
is large compared to the linear dimensions of the potential’s zone of influence). 
We can thus count the number dn of particles scattered per unit time into the solid 
angle dQ about the direction (6, @). 

dn is obviously proportional to dQ and to the incident flux F;. We shall define 
a(0, p) to be the coefficient of proportionality between dn and F, dQ: 


| dn = F; o(6, 9) dQ | (A-3) 


The dimensions of dn and F, are, respectively, T~! and (LTY +. (0, p) therefore 
has the dimensions of a surface; it is called the differential scattering cross 
section in the direction (8, @). Cross sections are frequently measured in barns and 
submultiples of barns : 


l barn = 10724 cm? (A-4) 


The definition (A-3) can be interpreted in the following way: the number of 
particles per unit time which reach the detector is equal to the number of particles 
which would cross a surface o(@, p) dQ placed perpendicular to Oz in the incident 
beam. 

Similarly, the total scattering cross section o is defined by the formula: 


o= foe. p) dQ (A-5) 


COMMENTS: 


(i) Definition (A-3), in which dn is proportional to dQ, implies that only the 
scattered particles are taken into consideration. The flux of these particles 
reaching a given detector D [of fixed surface and placed in the direction (0, p)] 
is inversely proportional to the square of the distance between D and O (this 
property is characteristic of a scattered flux). In practice, the incident beam 
is laterally bounded [although its width remains much larger than the extent 
of the zone of influence of V(r)], and the detector is placed outside its 
trajectory so that it receives only the scattered particles. Of course, such an 
arrangement does not permit the measurement of the cross section in the 
direction 0 = 0 (the forward direction), which can only be obtained by 
extrapolation from the values of o(@, p) for small 8. 

(ii) The concept of a cross section is not limited to the case of elastic scattering: 
reaction cross sections are defined in an analogous manner. 
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4. Organization of this chapter 


§B is devoted to a brief study of scattering by an arbitrary potential V(r) 
(decreasing however faster than 1/r as r tends toward infinity). First of all, in $B-1, 
we introduce the fundamental concepts of a stationary scattering state and 
a scattering amplitude. We then show, in §B-2, how knowledge of the asymptotic 
behavior of the wave functions associated with stationary scattering states enables 
us to obtain scattering cross sections. Afterwards, in §B-3, we discuss in a more 
precise way, using the integral scattering equation, the existence of these stationary 
scattering states. Finally (in §B-4), we derive an approximate solution of this 
equation, valid for weak potentials. This leads us to the Born approximation, in 
which the cross section is very simply related to the Fourier transform of the potential. 

For a central potential V(r), the general methods described in §B clearly 
remain applicable, but the method of partial waves, set forth in §C, is usually consi- 
dered preferable. This method is based (§C-1) on the comparison of the stationary 
states with well-defined angular momentum in the presence of the potential V(r) 
(which we shall call “partial waves”) and their analogues in the absence of the 
potential (“free spherical waves”). Therefore, we begin by studying, in §C-2, the 
essential properties of the stationary states of a free particle, and more particularly 
those of free spherical waves. Afterwards (§C-3), we show that the difference bet- 
ween a partial wave in the potential V(r) and a free spherical wave with the same 
angular momentum / is characterized by a “ phase shift” 6,. Thus, it is only necessary 
to know how stationary scattering states can be constructed from partial waves 
in order to obtain the expression of cross sections in terms of phase shifts (§C-4). 


B. STATIONARY SCATTERING STATES. 
CALCULATION OF THE CROSS SECTION 


In order to describe in quantum mechanical terms the scattering of a given 
incident particle by the potential V(r), it is necessary to study the time evolution 
of the wave packet representing the state of the particle. The characteristics of this 
wave packet are assumed to be known for large negative values of the time f, when 
the particle is in the negative region of the Oz axis, far from and not yet affected by 
the potential V(r). [t is known that the subsequent evolution of the wave packet 
can be obtained immediately if it is expressed as a superposition of stationary states. 
This is why we are going to begin by studying the eigenvalue equation of the 
Hamiltonian : 


H = H, + Vir) (B-1) 
where: 
P? 
Ho = 2u (B-2) 


describes the particle’s kinetic energy. 
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Actually, to simplify the calculations, we are going to base our reasoning 
directly on the stationary states and not on wave packets. We have already used 
this procedure in chapter I, in the study of “square” one-dimensional potentials 
(§ D-2 and complement H,). It consists of considering a stationary state to represent 
a “probability fluid” in steady flow, and studying the structure of the corresponding 
probability currents. Naturally, this simplified reasoning is not rigorous: it remains 
to be shown that it leads to the same results as the correct treatment of the problem, 
which is based on wave packets. Assuming this will enable us to develop certain 
general ideas easily, without burying them in complicated calculations*. 


1. Definition of stationary scattering states 
a. EIGENVALUE EQUATION OF THE HAMILTONIAN 


Schrédinger’s equation describing the evolution of the particle in the 
potential V(r) is satisfied by solutions associated with a well-defined energy E 
(stationary states): 


Y (r, t) = e(r)e (B-3) 
where g(r) is a solution of the eigenvalue equation: 


k Pa + JS = E g(r) (B-4) 


We are going to assume that the potential V(r) decreases faster than 1/r as r 
approaches infinity. Notice that this hypothesis excludes the Coulomb potential, 
which demands special treatment; we shall not consider it here. 

We shall only be concerned with solutions of (B-4) associated with a positive 
energy E, equal to the kinetic energy of the incident particle before it reaches 
the zone of influence of the potential. Defining: 


putt (B-5) 
ve) = = Ut) (B-6) 


enables us to write (B-4) in the form: 
[4 + k? — U(r)lo(r) = 0 (B-7) 


For each value of k (that is, of the energy E), equation (B-7) can be satisfied by 
an infinite number of solutions (the positive eigenvalues of the Hamiltonian H are 


* The proof was given in complement J,, for a particular one-dimensional problem; we verified 
that the same results are obtained by calculating the probability current associated with a stationary 
scattering state or by studying the evolution of a wave packet describing a particle which undergoes 
a collision. 
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infinitely degenerate). As in “square” one-dimensional potential problems 
(cf. chap. I, §D-2 and complement H,), we must choose from amongst these 
solutions the one which corresponds to the physical problem being studied (for 
example, when we wanted to determine the probability that a particle with a given 
energy would cross a one-dimensional potential barrier, we chose the stationary 
state which, in the region on the other side of the barrier, was composed simply of 
a transmitted wave). Here, the choice proves to be more complicated, since the 
particle is moving in three-dimensional space and the potential V(r) has, a priori, 
an arbitrary form. Therefore, we shall specify, using wave packet properties 
in an intuitive way, the conditions that must be imposed on the solutions of 
equation (B-7) if they are to be used in the description of a scattering process. We 
shall call the eigenstates of the Hamiltonian which satisfy these conditions stationary 


scattering states, and we shall designate by v(“Mr) the associated wave functions. 


b. ASYMPTOTIC FORM OF STATIONARY SCATTERING STATES. 
SCATTERING AMPLITUDE 


For large negative values of ¢, the incident particle is free [ V(r) is practically 
zero when one is sufficiently far from the point O |, and its state is represented by 
a plane wave packet. Consequently, the stationary wave function that we are 
looking for must contain a term of the form e***, where k is the constant which 
appears in equation (B-7). When the wave packet reaches the region which is under 
the influence of the potential V(r), its structure is profoundly modified and its 
evolution complicated. Nevertheless, for large positive values of ¢, it has left this 
region and once more takes on a simple form: it is now split into a transmitted 
wave packet which continues to propagate along Oz in the positive direction (hence 
having the form e**) and a scattered wave packet. Consequently, the wave function 
vr), representing the stationary scattering state associated with a given energy 
E = h*k?/2u,will be obtained from the superposition of the plane wave e'** and a 
scattered wave (we are ignoring the problem of normalization). 

The structure of the scattered wave obviously depends on the potential V(r). 
Yet its asymptotic form (valid far from the zone of influence of the potential) is 
simple; reasoning by analogy with wave optics, we see that the scattered wave must 
present the following characteristics for large r: 


(i) In a given direction (0, p), its radial dependence is of the form e**/r. 
It is a divergent (or “outgoing”) wave which has the same energy as the incident 
wave. The factor 1/r results from the fact that there are three spatial dimensions : 
(4 + k?) e™ is not zero, while: 


e" 


(4 + k?) 7 


=0 for r > r, where r, is any positive number (B-8) 


(in optics, the factor 1/r insures that the total flux of energy passing through a sphere 
of radius r is independent of r for large r; in quantum mechanics, it is the probability 
flux passing through this sphere that does not depend on r). 

(ii) Since scattering is not generally isotropic, the amplitude of the outgoing 
wave depends on the direction (0, @) being considered. 
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Finally, the wave function v{4'"r) associated with the stationary scattering 
state is, by definition, the solution of equation (B-7) whose asymptotic behavior is 
of the form : 


| A p . eik 
| aim) elf + f(O, Ta (B-9) 


In this expression, only the function /,(0, @), which is called the scattering amplitude, 
depends on the potential V(r). It can be shown (cf. $B-3) that equation (B-7) has 
indeed one and only one solution, for each value of k, that satisfies condition (B-9). 


COMMENTS: 


(1) We have already pointed out that in order to obtain simply the time 
evolution of the wave packet representing the state of the incident particle, 
it is necessary to expand it in terms of eigenstates of the total Hamiltonian H 
rather than in terms of plane waves. Therefore, let us consider a wave function 
of the form*: 


Wir, 1) = | “dk gk) A) oH (B-10) 


where : 
h?k? 
k7 2u 


(B-11) 


and where the function g(k), taken to be real for the sake of simplicity, has 
a pronounced peak at k = ky and practically vanishes elsewhere. y(r, £) is 
a solution of Schrödinger’s equation and therefore correctly describes the 
time evolution of the particle. It remains to be shown that this function indeed 
satisfies the boundary conditions imposed by the particular physical problem 
being considered. According to (B-9), it approaches asymptotically the sum of 
a plane wave packet and a scattered wave packet : 


A 


Wr, t) >, | dlegk) e" = ah +| “dk gk) A0, p) 
o 0 


etk" 


er iExt/h (B-12) 


r 


The position of the maximum of each of these packets can be obtained from 
the stationary phase condition (cf. chap. I, $C-2). A simple calculation then 
gives for the plane wave packet: 


Zult) = vgt (B-13) 
with: 
Ug = — (B-14) 


* Actually, it is also necessary to superpose the plane waves corresponding to wave vectors k 
having slightly different orientations, for the incident wave packet is limited in the direction perpendicular 
to Oz. For the sake of simplicity, we are concerning ourselves here only with the energy dispersion 
(which limits the spread of the wave packet along Oz). 
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As for the scattered wave packet, its maximum in the direction (8, p) is located 
at a distance from the point O given by: 


ruy. pt) = — a (0, p) + vgt (B-15) 


where «,(0, @) is the derivative with respect to k of the argument of the 
scattering amplitude f,(0, @). Note that formulas (B-13) and (B-15) are valid 
only in the asymptotic region (that is, for large |z|). 

For large negative values of t, there is no scattered wave packet, as can 
be seen from (B-15). The waves of which it is composed interfere constructively 
only for negative values of r, and these values lie outside the domain permitted 
to r. Therefore, all that we find in this region is the plane wave packet, which, 
according to (B-13), is making its way towards the interaction region with 
a group velocity v¿. For large positive values of ¢, both packets are actually 
present; the first one moves off along the positive Oz axis, continuing along 
the path of the incident packet, and the second one diverges in all directions. 
The scattering process can thus be well described by the asymptotic 
condition (B-9). 


The spatial extension Az of the wave packet (B-10) is related to the momentum disper 
sion hAk by the relation: 


| 
Az =~ — (B-16) 


Ak 
We shall assume that Ak is small enough for Az to be much larger than the linear dimen- 
sions of the potential’s zone of influence. Under these conditions, the wave packet moving 
at a velocity vg towards the point O (fig. 3) will take a time : 


are (B-17) 
Dg Ug Ak 

a 

a ot =!) 

| we 

” 
au” 
Zone of influence 
of the potential 
dd — — ———- Az — ————— > 


FIGURE 3 


The incident wave packet of length 4z moves at a velocity vç towards the potential V(r); it interacts 
with the potential during a time of the order of AT = 4z/v¿ (assuming the size of the potential’s 
zone of influence to be negligible compared to 42). 
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to cross this zone. Let us fix the time origin at the instant when the center of the incident 
wave packet reaches point O. Scattered waves exist only for 1 2 — AT/2, i.e., after the 
forward edge of the incident wave packet has arrived at the potential's zone of influence. 
For ¢ = 0, the most distant part of the scattered wave packet is at a distance of the 
order of Az/2 from the point O. 

Let us now consider an a priori different problem, where we have a time-dependent 
potential, obtained by multiplying V(r) by a function f(r) which increases slowly 
from 0 to 1 between t = — 47/2 and t = 0. For t much less than — AT/2. the potential 
is zero and we shall assume that the state of the particle is represented by a plane 
wave (extending throughout all space). This plane wave begins to be modified only 
at t ~ — 4T/2, and at the instant ¢ = 0 the scattered waves look like those in the 
preceding case. 

Thus we see that there is a certain similarity between the two different problems 
that we have just described. On the one hand, we have scattering by a constant potential 
of an incident wave packet whose amplitude at the point O increases smoothly between 
the times — AT/2 and zero; on the other hand, we have scattering of a plane wave of 
constant amplitude by a potential that is gradually “turned on” over the same time 
interval [ — 47/2, 0]. ' 

If Ak —= 0, the wave packet (B-10) tends toward a stationary scattering state 
[g(k) tends toward d(k — kg)]; in addition, according to.(B-17), AT becomes infinite and 
the turning on of the potential associated with the function f(r) becomes infinitely slow 
(for this reason, it is often said to be “adiabatic”). The preceding discussion, although 
very qualitative, thus makes it possible to describe a stationary scattering state as the 
result of adiabatically imposing a scattering potential on a free plane wave. We could 
make this interpretation more precise by studying in a more detailed way the evolution 
of the initial plane wave under the influence of the potential f (1)V (r). 


2. Calculation of the cross section using probability currents 
a. PROBABILITY FLUID ASSOCIATED WITH A STATIONARY SCATTERING STATE 


In order to determine the cross section, one should study the scattering of an 
incident wave packet by the potential V(r). However, we can obtain the result much 
more simply by basing our reasoning on the stationary scattering states; we consider 
such a state to describe a probability fluid in steady flow and we calculate the cross 
section from the incident and scattered currents. As we have already pointed out, 
this method is analogous to the one we used in one-dimensional “square” barrier 
problems : in those problems, the ratio between the reflected (or transmitted) cur- 
rent and the incident current yielded the reflection (or transmission) coefficient 
directly. 

Hence we shall calculate the contributions of the incident wave and the scat- 
tered wave to the probability current in a stationary scattering state. We recall that 
the expression for the current J(r) associated with a wave function g(r) is : 


Hie) = 4 Re | ore Evan | (B-18) 
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b. INCIDENT CURRENT AND SCATTERED CURRENT 


The incident current J, is obtained from (B-18) by replacing ¢(r) by the plane 
wave e'*; J, is therefore directed along the Oz axis in the positive direction, and 
its modulus is: 


|J; = yi (B-19) 


Since the scattered wave is expressed in spherical coordinates in formula (B-9), 
we shall calculate the components of the scattered current J, along the local axes 
defined by this coordinate system. Recall that the corresponding components of 
the operator V are: 


ĉ 


(Y), A 
16 
(V)o = 7 30 
1 0 


(Y), = rand 66 (B-20) 


If we replace g(r) in formula (B-18) by the function /,(6, p)e*"/r, we can easily 
obtain the scattered current in the asymptotic region: 


h 1 1. ô 
(0 = Refi sre o fle, 0) 


(Sao = Toe E FEO, D flO, J (B-21) 


u r? sin ð ! 


Since r is large, (Jz), and (J,), are negligible compared to (J,),, and the scattered 
current is practically radial. 


c. EXPRESSION FOR THE CROSS SECTION 


The incident beam is composed of independent particles, all of which are 
assumed to be prepared in the same way. Sending a great number. of these par- 
ticles amounts to repeating the same experiment a great number of times with 
one particle whose state is always the same. If this state is v{¢"(r), it is clear that 
the incident flux F; (that is, the number of particles of the incident beam which cross 
a unit surface perpendicular to Oz per unit time) is proportional to the flux of the 
vector J, across this surface; that is, according to (B-19): 

F; = C |J; =; c” (B-22) 
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Similarly, the number dn of particles which strike the opening of the detector 
(fig. 2) per unit time is proportional to the flux of the vector J, across the surface dS 
of this opening [the proportionality constant C is the same as in (B-22)]: 


dn = C J,.dS = C(J,),? da 


hk 
= C7*|A(0. o)? 40 (B-23) 
We see that dn is independent of r if r is sufficiently large. 


If we substitute formulas (B-22) and (B-23) into the definition (A-3) of the 
differential cross section a(0, @), we obtain: 


6 (0,9) = [AO p)? (B-24) 


J 


The differential cross section is thus simply the square of the modulus of the 
scattering amplitude. 


d. INTERFERENCE BETWEEN THE INCIDENT AND THE SCATTERED WAVES 


In the preceding sections, we have neglected a contribution to the current associated 
with vi (r) in the asymptotic region: the one which arises from interference between the plane 
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FIGURE 4 


Before the collision (fig. a), the incident wave packet is moving towards the zone of influence of the 
potential. After the collision (fig. b), we observe a plane wave packet and a spherical wave packet 
scattered by the potential (dashed lines in the figure). The plane and scattered waves interfere 
in the forward direction in a destructive way (conservation of total probability); the detector D 
is placed in a lateral direction and can only see the scattered waves. 


914 


www.elsolucionario.net 
B. STATIONARY SCATTERING STATES 


wave e'“? and the scattered wave, and which is obtained by replacing g*(r) in (B-18) by e” ** 


and g(r) by f,(@. p) e*"/r, and vice versa. 

Nevertheless, we can convince ourselves that these interference terms do not appear 
when we are concerned with scattering in directions other than the forward direction (0 = 0). 
In order to do so, let us go back to the description of the collision in terms of wave packets (fig. 4), 
and let us take into consideration the fact that in practice the wave packet always has a finite 
lateral spread. Initially, the incident wave packet is moving towards the zone of influence 
of V(r) (fig. 4-a). After the collision (fig. 4-b), we find two wave packets : a plane one which 
results from the propagation of the incident wave packet (as if there were no scattering potential) 
and a scattered one moving away from the point O in all directions. The transmitted wave thus 
results from the interference between these two wave packets. In general, however, we place 
the detector D outside the beam, so that it is not struck by transmitted particles; thus we 
observe only the scattered wave packet and it is not necessary to take into consideration the 
interference terms that we have just mentioned. 

Yet it follows from figure 4-b that interference between the plane and scattered wave 
packets cannot be neglected in the forward direction, where they occupy the same region of 
space. The transmitted wave packet results from this interference. It must have a smaller 
amplitude than the incident packet because of conservation of total probability (that is, conser- 
vation of the number of particles: particles scattered in all directions of space other than the 
forward direction leave the beam, whose intensity is thus attenuated after it has passed the 
target). It is thus the destructive interference between the plane and forward-scattered wave 
packets that insures the global conservation of the total number of particles. 


3. Integral scattering equation 


We propose to show now, in a more precise way than in $ B-1-b, how one can 
demonstrate the existence of stationary wave functions whose asymptotic behavior 
is of the form (B-9). In order to do so, we shall introduce the integral scattering 
equation, whose solutions are precisely these stationary scattering state wave 
functions. 

Let us go back to the eigenvalue equation of H [formula (B-7)] and put it in 
the form: 


(4 + k*) g(r) = U(r) lr) (B-25) 


Suppose (we shall see later that this is in fact the case) that there exists 
a function G(r) such that : 


(A + k?) G(r) = òlr) (B-26) 


[G(r) is called the “Green’s function” of the operator 4 + k?]. Then any 
function g(r) which satisfies: 


olr) = paolr) + [eror — r') U(r’) g(r’) (B-27) 
where ¢,(r) is a solution of the homogeneous equation: 


(4 + A) polr) = 0 (B-28) 
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obeys the differential equation (B-25). To show this, we apply the operator 4 + k? 
to both sides of equation (B-27); taking (B-28) into account, we obtain: 


(A + k’) g(r) = (4 +k?) Jara — r) U(r’) olr’) (B-29) 


Assuming we can move the operator inside the integral, it will act only on the 
variable r, and we shall have, according to (B-26): 


(4 +k) olt) = [erat — 1) Ut) ole’) 


U(r) olr) (B-30) 


Inversely, it can be shown that any solution of (B-25) satisfies (B-27)*. The 
differential equation (B-25) can thus be replaced by the integral equation (B-27). 

We shall see that it is often easier to base our reasoning on the integral equation. 
Its principal advantage derives from the fact that by choosing olr) and G(r) 
correctly, one can incorporate into the equation the desired asymptotic behavior. 
Thus, one single integral equation, called the integral scattering equation, becomes 
the equivalent of the differential equation (B-25) and the asymptotic condition (B-9). 

To begin with, let us consider (B-26). It implies that (4 + k?)G(r) must be 
identically equal to zero in any region which does not include the origin [which, 
according to (B-8), js the case when G(r) is equal to e**"/+]. Moreover, according 
to formula (61) of appendix II, G(r) must behave like — 1/4zr when r approaches zero. 
In fact, it is easy to show that the functions : 


1 etikr 


as (B-31) 


G,(r) = “Ar r 


are solutions of equation (B-26). We may write: 


+ afv ( = 75)| [Vet] (B-32) 


A simple calculation then gives (cf. appendix II): 


AG, (t) = — k’°G, (r) + d(r) (B-33) 


which is what we wished to prove. G, and G_ are called, respectively “ outgoing and 
incoming Green’s functions”. 

The actual form of the desired asymptotic behavior (B-9) suggests the choice of 
the incident plane wave e“*? for p,(r) and the choice of the outgoing Green’s function 


* This can be seen intuitively if one considers U(r)p (r) to be the right-hand side of a differential 
equation : the general solution of (B-25) is then obtained by adding to the general solution of the 
homogeneous equation a particular solution of the complete equation [second term of (B-27)]. 
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G,(r) for G(r). In fact, we are going to show that the integral scattering equation 
can be written : 


aer) = et + [erce — r') U(r) aitor) (B-14) 


whose solutions present the asymptotic behaviour given by (B-9). 

To do this, let us place ourselves at a point M (position r), very far from the 
various points P (position r') of the zone of influence of the potential, whose linear 
dimensions are of the order of L* (fig. 5): 


r >L 
rs L (B-35) 


FIGURE 5 


Approximate calculation of the distance |r — r'| between 
a point Af very far from O and a point P situated in the 
zone of influence of the potential (the dimensions of this 
zone of influence are of the order of Z). 


Since the angle between MO and MP is very small, the length MP (that is, |r — r'|) 
is equal, to within a good approximation, to the projection of MP on MO: 


~=r-u.r (B-36) 


r-r 
where u is the unit vector in the r direction. It follows that, for large r: 


1 ik]r -r'| 1 tke ; ; 
e => ek ur (B-37) 


hne- r| 4r 


G,(r —r') = 


* Recall that we have explicitly assumed that U(r) decreases at infinity faster than | r. 
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Substituting this expression back into equation (B-34), we obtain the asymptotic 
behavior of v{4i"(r): 


7 A 1 eikr iii z N , 
er) pao. et — AA E arr eT e U(r) g(r’) (B-38) 


which is indeed of the form (B-9), since the integral is no longer a function of the 
distance r = OM but only (through the unit vector u) of the polar angles 0 and gy 
which fix the direction of the vector OM. Thus, by setting : 


A0, p) = — = fer eT% er U(r’) giar) (B-39) 


we are led to an expression which is identical to (B-9). 

It is therefore clear that the solutions of the integral scattering equation (B-34) 
are indeed the stationary scattering states *. 

COMMENT: 


It is often convenient to define the incident wave vector k, as a vector of 
modulus k directed along the Oz axis of the beam such that : 


ez — eiker (B-40) 


In the same way, the vector k, which has the same modulus k as the incident 
wave vector but whose direction is fixed by the angles 0 and @ is called the 
scattered wave vector in the direction (0, q): 


k,=ku (B-41) 


Finally, the scattering (or transferred) wave vector in the direction (0, @) is the 
difference between k, and k; (fig. 6): 


K = k, — k; (B-42) 
> 
k 7 K 
FIGURE 6 
y A ee Incident wave vector k,, scattered wave vector k, 
k, and transferred wave vector K. 


* In order to prove the existence of stationary scattering states rigorously, it would thus be 
sufficient to demonstrate that equation (B-34) admits a solution. 
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4, The Born approximation 


a. APPROXIMATE SOLUTION 
OF THE INTEGRAL SCATTERING EQUATION 


If we take (B-40) into account, we can write the integral scattering equation 
in the form: 


pip) = eiker + [erot — U(r’) sir) (B-43) 


We are going to try to solve this equation by iteration. 
A simple change of notation (r => r' ; r' => r”) permits us to write: 


raiO) = eiker 4 [erot — r”) U(r”) ver") (B-44) 
Inserting this expression in (B-43), we obtain: 
¿A = eter + [erci — r) U(r) eit" 
+ fer [erca — r) U(r) G(r — r“) U(r") etr") (B-45) 


The first two terms on the right-hand side of (B-45) are known; only the third one 
contains the unknown function t4M(r). This procedure can be repeated : changing r 
to r” andr’ tor” in (B-43) gives v{4'"(r"), which can be reinserted in (B-45). We then 
have: 


rir) = er + [arc — r) U(r) et" 
+ fer [aras — r) U(r) G(r — r”) U(r’) ker’ 


+ fer [er fora. — r) U(r) G(r — r“) U(r’) 
x G(r — r”) U(r”) esir”) (B-46) 


where the first three terms are known; the unknown function or) has been 
pushed back into the fourth term. 

Thus we can construct, step by step, what is called the Born expansion of the 
stationary scattering wave function. Note that each term of this expansion brings 
in One higher power of the potential than the preceding one. Thus, if the potential 
is weak, each successive term is smaller than the preceding one. If we push the 
expansion far enough, we can neglect the last term on the right-hand side and thus 
obtain v(“iM(r) entirely in terms of known quantities. 
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If we substitute this expansion of v{'(r) into expression (B-39), we obtain 
the Born expansion of the scattering amplitude. In particular, if we limit ourselves 
to first order in U, all we need to do is replace v@'(r’) by e'**"’ on the right-hand 
side of (B-39). This is the Born approximation: 


f P10, p) = x [er e ik ur U(r’) eiker 


A 
4r 


dy e7 kak’ (y) 
2 ghe eK U(r’) (B-47) 
4r 


where K is the scattering wave vector defined in (B-42). The scattering cross section, 
in the Born approximation, is thus very simply related to the Fourier transform of 
the potential, since, using (B-24) and (B-6), (B-47) implies : 

2 


(B) pe 
oa, (8, p) = Anh 


(B-48) 


{er e" Kry(r) 


According to figure 6, the direction and modulus of the scattering wave 
vector K depend both on the modulus k of k; and k, and on the scattering 
direction (0, p). Thus, for a given 0 and q, the Born cross section varies with k, 
that is, with the energy of the incident beam. Similarly, for a given energy, 
co?) varies with 0 and y. We thus see, within the simple framework of the Born 
approximation, how studying the variation of the differential cross section in 
terms of the scattering direction and the incident energy gives us information about 
the potential V(r). 


b. INTERPRETATION OF THE FORMULAS 


We can give formula (B-45) a physical interpretation which brings out very 
clearly the formal analogy between quantum mechanics and wave optics. 

Let us consider the zóne of influence of the potential to be a scattering medium 
whose density is proportional to U(r). The function G,(r — r') [formula (B-31)] 
represents the amplitude at the point r of a wave radiated by a point source situated 
at r’. Consequently, the first two terms of formula (B-45) describe the total wave 
at the point r as the result of the superposition of the incident wave e'**" and an infi- 
nite number of waves coming from secondary sources induced in the scattering 
medium by the incident wave. The amplitude of each of these sources is indeed 
proportional to the incident wave (e'***") and the density of the scattering mate- 
rial [ U(r’)], evaluated at the corresponding point r’. This interpretation, symbolized 
by figure 7. recalls Huygens’ principle in wave optics. 

Actually, formula (B-45) includes a third term. However, we can interpret in 
an analogous fashion the successive terms of the Born expansion. Since the 
scattering medium extends over a certain area, a given secondary source is excited 
not only by the incident wave but also by scattered waves coming from other 
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FIGURE 7 


Schematic representation of the Born 
approximation: we only consider the 
incident wave and the waves scattered 
by one interaction with the potential. 


secondary sources. Figure 8 represents symbolically the third term of the Born 
expansion [cf. formula (B-46)]. If the scattering medium has a very low 
density [U(r) very small], we can neglect the influence of secondary sources on 
each other. 


AA ISS 
FIGURE 8 

A 
Schematic representation of the 
second-order term in U in the Born 
expansion: here we consider waves 
which are scattered twice by the poten- 
tial. 

COMMENT: 


The interpretation that we have just given for higher-order terms in the 
Born expansion has nothing to do with the multiple scattering processes 
that can occur inside a thick target: we are only concerned, here, with 
the scattering of one particle of the beam by a single particle of the target, 
while multiple scattering brings in the successive interactions of the same 
incident particle with several different particles of the target. 


C. SCATTERING BY A CENTRAL POTENTIAL. 
METHOD OF PARTIAL WAVES 


1. Principle of the method of partial waves 


In the special case of a central potential V(r), the orbital angular momen- 
tum L of the particle is a constant of the motion. Therefore, there exist stationary 
states with well-defined angular momentum: that is, eigenstates common to H, 
L and L,. We shall call the wave functions associated with these states partial 
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waves and we shall write them q, ,,,(r). The corresponding eigenvalues of H, 
L? and L, are, respectively, h?k?/2u, (1 + 1)h? and mh. Their angular dependence is 
always given by the spherical harmonics Y7'(0, ); the potential V(r) influences 
only their radial dependence. 

We expect that, for large r, the partial waves will be very close to the common 
eigenfunctions of Ha, L? and L., where H, is the free Hamiltonian [formula (B-2)]. 
This is why we are first going to study, in §C-2, the stationary states of a free particle, 
and, in particular, those which have a well-defined angular momentum. The corres- 
ponding wave functions p(?,,(r) are free spherical waves : their angular dependence 
is, Of course, that of a spherical harmonic and we shall see that the asymptotic 
expression for their radial function is the superposition of an incoming wave e *"/r 
and an outgoing wave e**"/r with a well-determined phase difference. 

The asymptotic expression for the partial wave Q, imr) in the potential V(r) 
is also ($C-3) the superposition of an incoming wave and an outgoing wave. 
However. the phase difference between these two waves is different from the one 
which characterizes the corresponding free spherical wave: the potential V(r) 
introduces a supplementary phase shift ò. This phase shift constitutes the only 
difference between the asymptotic behavior of P, im and that of {°} „- Conse- 
quently, for fixed A, the phase shilts 0, for all values of / are all we need to know 
to be able to calculate the cross section. 

In order to carry out this calculation, we shall express (§C-4) the stationary 
scattering state Kir) as a linear combination of partial waves ~, , (Fr) having the 
same energy but different angular momenta. Simple physical arguments suggest 
that the coefficients of this linear combination should be the same as those of the 
free spherical wave expansion of the plane wave e™ ; this is in fact confirmed by an 
explicit calculation. 

The use of partial waves thus permits us to express the scattering amplitude. 
and hence the cross section, in terms of the phase shifts 6,. This method is parti- 
cularly attractive when the range of the potential is not moch longer than the wave- 
length associated with the particle’s motion, for, in this case, only a small number of 
phase shifts are involved ($C-3-b-B). 


2. Stationary states of a free particle 


In classical mechanics, a free particle of mass u moves along a uniform linear 
trajectory. Its momentum p, its energy E = p?/24 and its angular momentum 
XL =r x p relative to the origin of the coordinate system are constants of the 
motion. 

In quantum mechanics, the observables P and L = R x P do not commute. 
Hence they represent incompatible quantities: it is impossible to measure the 
momentum and the angular momentum of a particle simultaneously. 

The quantum mechanical Hamiltonian H, is written: 


H, = — P? (C-1) 
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Ho does not constitute by itself a C.S.C.O.: its eigenvalues are infinitely dege- 
nerate ($2-a). On the other hand, the four observables : 


Ho. ae ae P, (C-2) 


form a C.S.C.O. Their common eigenstates are stationary states of well defined 
momentum. A free particle may also be considered as being placed in a zero central 
potential. The results of chap. VII then indicate that the three observables : 


Ha. L?, L (C-3) 


form a C.S.C.O. The corresponding eigenstates are stationay states with well- 
defined angular momentum (more precisely, L? and L, have well-defined values, 
but L, and L, do not). 

The bases of the state space defined by the C.S.C.O.'s (C-2) and (C-3) are 
distinct, since P and L are incompatible quantities. We are going to study these 
two bases and show how one can pass from one to the other. 


a. STATIONARY STATES WITH WELL-DEFINED MOMENTUM. 
PLANE WAVES 


We already know (c/. chap. 11, E-2-d) that the three observables P,, P, and P, 
form a C.S.C.O. (for a spinless particle). Their common eigenstates form a basis 
for the [ | p > ) representation: 


P|p>=p\|p> (C-4) 


Since H, commutes with these three observables, the states | p > are necessarily 
eigenstates of Ho: 


Ho| p> =3;]P> (C-5) 


The spectrum of H, is therefore continuous and includes all positive numbers 
and zero. Each of these eigenvalues is infinitely degenerate: to a fixed positive 
energy E there corresponds an infinite number of kets |p > since there exists an 
infinite number of ordinary vectors p whose modulus satisfies : 


pl = V2uE (C-6) 


The wave functions associated with the kets |p> are the plane waves 
(cf. chap. I, SE-1-a): 


Go = (zg) e (C-7) 


We shall introduce here the wave vector k to characterize a plane wave : 


p 
k=} (C-8) 
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and we shall define: 
|k > =(")*" | p> (C-9) 


The kets | k > are stationary states with well-defined momentum: 


h7k? 
Ho] k > => ko (C-10-a) 
P|k> =”k|k> (C-10-b) 
They are orthonormal in the extended sense: 
<k|k’> = ó(k — k’) (C-11) 


and form a basis in the state space: 
DIGAS (C-12) 


The associated wave functions are the plane waves normalized, in a slightly 
different way : 


1 3/2 
<r|k> = (2) eres (C-13) 


b. STATIONARY STATES WITH WELL-DEFINED ANGULAR MOMENTUM. 
FREE SPHERICAL WAVES 


In order to obtain the eigenfunctions common to Hoy, L? and L,, all we have 
to do is solve the radial equation for an identically zero central potential. The 
detailed solution of this problem is given in complement Ayıq; we shall be satisfied 
here with giving the results. 

Free spherical waves are the wave functions associated with the well-defined 
angular momentum stationary states | o{°),, > of a free particle; they are written : 


ja 
Ole) = I ihr) YFG, 6) (C-14) 


where j, is a spherical Bessel function defined by: 


i eT ¡(1 d Y sinp . 
ido) = ( (Sas) : (C-15) 


The corresponding eigenvalues of H,, L? and L, are, respectively, h?k?/2p, 
I(l + 1)? and mh. 
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The free spherical waves (C-14) are orthonormal in the extended sense : 


2 = eae 
L Olm | OP bm > = E KK | jlkr) (Kr) r? dr x f dO Y7*(6, p) Y7 (0, 9) 
0 
= Ó(k — k’) w Ömm (C-16) 


and form a basis in the state space: 


20 
0 0 — 
E E y | Plm > < Perm | = 1 (C-17) 
1=0 m= -i 
c. PHYSICAL PROPERTIES OF FREE SPHERICAL WAVES 


a. Angular dependence 

The angular dependence of the free spherical wave PK? mle) is entirely given 
by the spherical harmonic Y7 (0, o). It is thus fixed by the eigenvalues of L? and L, 
(that is, by the indices / and m) and not by the energy. For example, a free s(/ = 0) 
spherical wave is always isotropic. 


B. Behavior in the neighborhood of the origin 


Let us consider an infinitesimal solid angle dQ, about the direction (09, po); 


when the state of the particle is | p¿%,, >, the probability of finding the particle in 


this solid angle between r and r + dr is proportional to: 
r° ji (kr)| Y7(Oo, Po)? dr dQ, (C-18) 


It can be shown (complement Ayp $ 2-c-a) that when p approaches zero: 


l 


p 
iN?) „zo r F iT (C-19) 


This result (which the discussion of chapter VII, §A-2-c would lead us to expect) 
implies that the probability (C-18) behaves like r?'*? near the origin; hence, the 
larger / is, the more slowly it increases. 

The shape of the function p?j?(p) is shown in figure 9. We see that this 
function remains small as long as: 


p < VII +1) (C-20) 


We may thus assume that the probability (C-18) is practically zero for : 
r <+ VNF 1) (C-21) 


925 


www.elsolucionario.net 


CHAPTER VII SCATTERING BY A POTENTIAL 


FIGURE 9 


Graph of the function p?;2(p) giving the radial dependence of the probability of finding the particle 


in the state | {°}, >. At the origin, this function behaves like p?'*?; it remains practically zero 


as long as p < Vil + 1). 


This result is very important physically for it implies that a particle in the 
state | @{°?,, > is practically unaffected by what happens inside a sphere centered 


at O of radius: 


b(k) = iv Kl + 1) (C-22) 


We shall return to this point in $C-3-b-f. 


COMMENT: 


In classical mechanics, a free particle of momentum p and angular 
momentum .£ moves in a straight line whose distance b from the point O 
is given (fig. 10) by: 


b a (€-23) 


b is called the “collision” or “impact parameter” of the particle relative to O; 
the larger || is and the smaller the momentum (i.e. the energy), the larger b is. 


If | Z| is replaced by h Jil + 1) and |p| by fk in (C-23), we again find expres- 
sion (C-22) for b,(k), which can thus be interpreted semi-classically. 
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FIGURE 10 


Definition of the classical impact parameter b of 
a particle of momentum p and angular momen- 
tum & relative to O. 


Y. Asymptotic behavior 


It can be shown (complement Ay, $2-c-B) that as p approaches infinity : 
: 1. pp 
MP) oP (C-24) 


Consequently, the asymptotic behavior of the free spherical wave ~;”,,,, (I) is: 


E eT e3 it a 
(0) fp Rea, Es £ ym A A A 
Pkl" 9. P) x, J = YrO. 9) Sik; (C-25) 


At infinity, pí%,, therefore results from the superposition of an incoming 
wave e ir and an outgoing wave e™"/r, whose amplitudes differ by a phase 
difference equal to /x. 


COMMENT: 


Suppose that we construct a packet of free spherical waves, all correspon- 
ding to the same values of / and m. A line of reasoning analogous to that 
of comment (i) of $B-I-b may be applied to it. The following conclusion 
results: for large negative values of f, only an incoming wave packet exists; 
while for large positive values of ¢, only an outgoing wave packet exists. 
Therefore, a free spherical wave may be thought of in the following manner : 
at first we have an incoming wave converging towards O; it becomes 
distorted as it approaches this point, retraces its steps when it is at a distance 
of the order of b,(k) [formula (C-22)], and gives rise to an outgoing wave 
with a phase shift of /z. 
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d. EXPANSION OF A PLANE WAVE 
IN TERMS OF FREE SPHERICAL WAVES 


We thus have two distinct bases formed by eigenstates of H, : the { | k > } basis 
associated with the plane waves and the { | o{°),, > ) basis associated with the free 
spherical waves. It is possible to expand any ket of one basis in terms of vectors 
of the other one. 

Let us consider in particular the ket | 0, 0, k >, associated with a plane wave 
of wave vector k directed along Oz: 


1 3/2 
(r]0,0,k > = (2) elke (C-26) 


| O, 0, k > represents a state of well-defined energy and momentum (E = ħ?k?/2u ; 
p directed along Oz with modulus hk). Now: 


pliz Z efkr coso (C-27) 


E : ho. : 
is independent of y; since L, corresponds to 73 i the { |r > } representation, 
p 


the ket | 0, 0, k > is also an eigenvector of £,, with the eigenvalue zero: 
L, | 9, 0k>=0 (C-28) 


Using the closure relation (C-17), we can write: 
9% 00 +1 

10,0,k) = | wÈ S A <P 0,0, k) (C-29) 
o Somas 


Since | 0, 0, k > and | p/%,, > are two eigenstates of Ho, they are orthogonal if the 
corresponding eigenvalues are different; their scalar product is therefore propor- 
tional to 6(k’ — k). Similarly, they are both eigenstates of L, and their scalar 
product is proportional to 6, [cf. relation (C-28)]. Formula (C-29) therefore 
takes on the form: 


00 


[0,0,k> = Y Carl Oto (C-30) 
i 


=0 


The coefficients c, , can be calculated explicitly (complement Ayin, $3). Thus 
we obtain: 


m: 00 — 
el = SV 4(2l + 1) jkr) Y9(0) (C-31) 


1=0 


A state of well-defined linear momentum is therefore formed by the superposition of 
states corresponding to all possible angular momenta. 
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COMMENT: 
The spherical harmonic Y? (0) is proportional to the Legendre polynomial 
P,(cos 0) (complement Ay,, $ 2-e-a): 


Y?(@) = fe) P (cos 8) (C-32) 


V 4n 


Hence the expansion (C-31) is often written in the form: 


e" = 3 it (21 + 1) jkr) P,(cos 0) (C-33) 


t=0 


3. Partial waves in the potential V(r) 


We are now going to study the eigenfunctions common to H (the total 
Hamiltonian), L? and L,; that is, the partial waves 9, , m(t). 


a. RADIAL EQUATION. PHASE SHIFTS 
For any central potential V(r), the partial waves 9, , ,,(r) are of the form: 
m 1 m 
Prt) = Ry, kr) Y7'(0, 9) = F u(r) Y7'(8, p) (C-34) 


where u, (r) is the solution of the radial equation : 


h? d? (l + 1)h? hk? 
| - Aa a + at E E Ve) isd = — mlr) (C-35) 
2u dr 2ur 2H 
satisfying the condition at the origin: 
u, (0) = 9 (C-36) 


It is just as if we were dealing with a one-dimensional problem, where a 
particle of mass y is under the influence of the potential (fig. 11): 


Kl + 1972 


2 


Vig(r) = V(r) + yr forr > 0 


Vs (r) infinite forr <0 (C-37) 
For large r, equation (C-35) reduces to: 


E + e lun, 0 (C-38) 


dr? >o 
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ES The effective potential K y(r) is the 
/ (r) sum of the potential V(r) and the 

1 Vir 2 

ae centrifugal term ey i 

2er? 
whose general solution is: 

ur), y 40% + Berto (C-39) 


Since u, (r) must satisfy condition (C-36), the constants A and B cannot be arbitrary. 
In the equivalent one-dimensional problem [formulas (C-37)], equation (C-36) is 
related to the fact that the potential is infinite for negative r, and expression (C-39) 
represents the superposition of an “incident” plane wave e” * coming from the 
right (along the axis on which the fictitious particle being studied moves) and a 
“reflected” plane wave ec" propagating from left to right. Since there can be no 
“transmitted” wave [as V(r) is infinite on the negative part of the axis], the 
“reflected” current must be equal to the “incident” current. Thus we see that 
condition (C-36) implies that, in the asymptotic expression (C-39): 


14] = |B| (C-40) 
Consequently : 
Uy. lr) ~, |A] Le eit +7"! elt] (C-41) 


which can be written in the form: 
U (r) „z, Csin (kr — fi) (C-42) 
The real phase f, is completely determined by imposing continuity 
between (C-42) and the solution of (C-35) which goes to zero at the origin. In the 


case of an identically null potential V(r), we saw in $C-2-c-y that $, is equal to /7/2. 
It is convenient to take this value as a point of reference, that is, to write: 


u(r) ~~, C sin (xr = IS + à) (C-43) 
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The quantity ò defined in this way is called the phase shiji of the partial wave 
Pi. 1.mlt): it obviously depends on A, that is, on the energy. 


b. PHYSICAL MEANING OF PHASE SHIFTS 
A. Comparison between partial waves and free spherical waves 


Taking (C-34) and (C-43) into account, we may write the expression for the 
asymptotic behavior of ¢, , ,,(r) in the form: 


Bina. At} c sin (kr — lz/2 + 0)) 
kin) A, € 


ry } 


Y7(0. 9) 


-ikr i (th - ikr -i(t 6 
2 ~ CMO, p 22222 (i=) ae ot) 


(C-44) 
We see that the partial wave ¢, ,,,(f), like a free spherical wave [formula (C-25)], 
results from the superposition of an incoming wave and an outgoing wave. 

In order to develop the comparison between partial waves and free spherical 
waves in detail, we can modify the incoming wave of (C-44) so as to make it 
identical with the one in (C-25). To do this, we define a new partial wave $, ,,,(I) 
by multiplying ¢, ,,,,(r) by e (this global phase factor has no physical importance) 
and by choosing the constant C in such a way that: 


-ikr elln/2 = ef" e in/2 218, 


El 
2ikr (C-45) 


X e 
Pr.tmlT) Poy > Yro, p) 


This expression can then be interpreted in the following way (cf. the comment 
in $C-2-c-y): at the outset, we have the same incoming wave as in the case of a free 


particle (aside from the normalization constant Y 2k?/m). As this incoming wave 
approaches the zone of influence of the potential, it is more and more perturbed 
by this potential. When, after turning back, it is transformed into an outgoing wave, 
it has accumulated a phase shift of 26, relative to the free outgoing wave that 
would have resulted if the potential V(r) had been identically zero. The factor e?'? 
(which varies with / and A) thus finally summarizes the total effect of the potential 
on a particle of angular momentum /. 


COMMENT: 


Actually, the preceding discussion is only valid if we base our reasoning on a wave 
packet formed by superposing partial waves q, ,,,(r) with the same / and m, but slightly 
different k. For large negative values of r. we have only an incoming wave packet; it is 
the subsequent evolution of this wave packet directed towards the potential’s zone of 
influence that we have analyzed above. 

We could also adopt the point of view of comment (ii) of $B-1-b; that is, we 
could study the effect on a stationary free spherical wave of slowly “turning on” the 
potential V(r). The same type of reasoning would then demonstrate that the partial 
Wave 0; ; (1) can be obtained from a free spherical wave py?) ,,(r) by adiabatically turning 
on the potential V(r). 
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B.  Finite-range potentials 


Let us suppose that the potential V(r) has a finite range rọ ; that is, that : 
Vir) =0 for r>ro (C-46) 


We pointed out earlier (§C-2-c-f) that the free spherical wave ¢{°),,, scarcely 
penetrates a sphere centered at O of radius b,(k) [formula (C-22)]. Therefore, if we 
return to the interpretation of formula (C-45) that we have just given, we see 
that a potential satisfying (C-46) has virtually no effect on waves for which : 


blk) > ro (C-47) 


since the corresponding incoming wave turns back before reaching the zone of 
influence of V(r). Thus, for each value of the energy, there exists a critical 
value /,, of the angular momentum, which, according to (C-22), is given approxima- 
tely by: 


V lullu + 1) = kro (C-48) 


The phase shifts 6, are appreciable only for values of / less than or of the order 
of lm- 

The shorter the range of the potential and the lower the incident energy, the 
smaller /,,*. Therefore, it may happen that the only non-zero phase shifts are those 
corresponding to the first few partial waves: the s(/ = 0) wave at very low energy, 
followed by s and p waves for slightly greater energies, etc. 


4. Expression of the cross section in terms of phase shifts 


Phase shifts characterize the modifications, caused by the potential, of the 
asymptotic behavior of stationary states with well-defined angular momentum. 
Knowing them should therefore allow us to determine the cross section. In order 
to demonstrate this, all we must do is express the stationary scattering state v{dif{r) 
in terms of partial waves** and calculate the scattering amplitude in this way. 


a. CONSTRUCTION OF THE STATIONARY SCATTERING STATE 
FROM PARTIAL WAVES 


We must find a linear superposition of partial waves whose asymptotic 
behavior is of the form (B-9). Since the stationary scattering state is an eigenstate 
of the Hamiltonian H, the expansion of v{*'"\r) involves only partial waves having 


* J is of the order of kro, which is the ratio between the range r of the potential and the 
wavelength of the incident particle. 


** Tf there exist bound states of the particle in the potential V(r) (stationary states with negative 
energy), the system of partial waves does not constitute a basis of the state space; in order to form 
such a basis, it is necessary to add the wave functions of the bound states to the partial waves. 
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the same energy h*k?/2u. Note also that, in the case of a central potential V(r), the 
scattering problem we are studying is symmetrical with respect to rotation around 
the Oz axis defined by the incident beam. Consequently, the stationary scattering 
wave function v{4(r) is independent of the azimuthal angle 9, so that its expansion 
includes only partial waves for which m is zero. Finally, we have an expression 
of the form: 


Mr) = E c raolt) (C-49) 


1=0 


The problem thus consists of finding the coefficients c,. 


a. Intuitive argument 


When V(r) is identically zero, the function u(dM(r) reduces to the plane 
wave e™7, and the partial waves become free spherical waves p¿%,,(r). In this case, 
we already know the expansion (C-49) : it is given by formula (C-31). 

For non-zero V(r), v('(r) includes a diverging scattered wave as well as a 
plane wave. Furthermore, we have seen that $, , o(r) differs from {ù} ¿(r) in its 
asymptotic behavior only by the presence of the outgoing wave, which has the same 
radial dependence as the scattered wave. We should therefore expect that the 
coefficients c, of the expansion (C-49) will be the same as those in formula (C-31)*, 


that is: 


oir) = Y PV Aall + 1) Op o(t) (C-50) 
1=0 
COMMENT: 


We can also understand (C-50) in terms of the interpretation offered in comment (ii) 
of $B-1-b and the comment in $C-3-b-a. If we have a plane wave whose expansion is 
given by (C-31) and we turn on the potential V(r) adiabatically, the wave is transformed 
into a stationary scattering state: the left-hand side of (C-31) must then be replaced 
by v,ditn(r). In addition, each free spherical wave j,(kr)¥?(@) appearing on the right-hand 
side of (C-31) is transformed into a partial wave $, , o(r) when the potential is turned on. 
If we take into account the linearity of Schródinger”s equation, we finally obtain (C-50). 


B. Explicit derivation 


Let us now consider formula (C-50), which was suggested by a physical 
approach to the problem, and let us show that it does indeed supply the desired 
expansion. 

First of all, the right-hand side of (C-50) is a superposition of eigenstates 
of H having the same energy h?k?/2u; consequently, this superposition 
remains a stationary State. 


* Note that the expansion (C-31) brings in j,(kr)Y?(6), that is, the free spherical wave po 


divided by the normalization factor \/2k/n; this is why we defined Pr 1 mir) [formula (C-45)] from 
expression (C-25) divided by this same factor. 
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Therefore, all we must do is make sure that the asymptotic behavior of the 
sum (C-50) is indeed of type (B-9). In order to do this, we use (C-45): 


=0 


Im 
a 


hele ee ee 2 e21] (C-51) 


In order to bring out the asymptotic behavior of expansion (C-31), we write: 
edi — | + 2¡e'% sin ô, (C-52) 


and, regrouping the terms which are independent of 0,, we have: 


Y PAn + 1) Gp ~ — > i /4n(21 + 1) ¥%0) 


1=0 


—ikr ,iln/2 ikr ¿—ilm/2 ikr TE 
ee — ee EJ anh ag oog 
ANS AN AA de 3 piór 7 
x rn: Sk e'” sinó, | (C-53) 


Taking (C-25) and (C-31) into consideration, we recognize, in the first term of the 
right-hand side, the asymptotic expansion of the plane wave e“*, and we obtain 
finally : 


ikr 


x? V4n(21 +1) Bole) > e + 7,(0) £ (C-54) 
with* 

| Mp | 

Li) = i Y Anll + 1) e sin 6, ¥9(0) (C-55) 


i l 1=0 


We have thus demonstrated that the expansion of (C-50) is correct and have 
found at the same time the expression for the scattering amplitude /,(0) in terms 
of the phase shifts 6,. 


b. CALCULATION OF THE CROSS SECTION 


The differential scattering cross section is then given by formula (B-24): 


2 


= | A0] = ` y Varal 21 + 1) e* sin ò ¥0) 


pr 


(C-56) 


a 
2 


i 
* The factor i' is compensated by e "2 = (— i! = (+) 
I 
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from which we deduce the total scattering cross section by integrating over the 
angles: 

c= fao a(0) z Y 4r JQ DOr + 1) etrr) sin ô sin ðp 

TY 


x | dQ ¥°*(0) Y9(0) (C-57) 


Since the spherical harmonics are orthonormal [formula (D-23) of chapter VI], 
we have finally : 


E = + Y (21 + 1) sin? ò (C-58) 
2 1=0 


Thus the terms resulting from interference between waves of different angular 
momenta disappear from the total cross section. For any potential V(r), the 


ag AR bei ae 
contribution p~ + 1)sin* 6, associated with a given value of / is positive and 


. 4 
has an upper bound, for a given energy, of z (21 + 1). 


In theory. formulas (C-56) and (C-58) necessitate knowing all the phase 
shifts 6,. Recall (cf. $C-3-a) that these phase shifts can be calculated from the radial 
equation if the potential V(r) is known; this equation must be solved separately 
for each value of / (most of the time, moreover, this implies resorting to numerical 
techniques). In other words, the method of partial waves is attractive from a 
practical point of view only when there is a sufficiently small number of non-zero 
phase shifts. For a finite-range potential V(r), we saw in $C-3-b-B that the phase 
shifts 6, are negligible for / > /,,, the critical value /,, being defined by formula (C-48). 

When the potential V(r) is unknown at the outset, we attempt to reproduce 
the experimental curves which give the differential cross section at a fixed energy 
by introducing a small number of non-zero phase shifts. Furthermore, the very 
form of the -dependence of the cross section often suggests the minimum number 
of phase shifts needed. For example, if we limit ourselves to the s-wave, 
formula (C-56) gives an isotropic differential cross section (Y9 is a constant). Thus 
if the experiments in fact imply a variation of o(@) with 0, it means that phase 
shifts other than that of the s-wave are not equal to zero. Once we have thereby 
determined, from experimental results corresponding to different energies, the 
phase shifts which do effectively contribute to the cross section, we can look for 
theoretical models of potentials which produce these phase shifts and their energy 
dependence. 


COMMENT: 


The dependence of cross sections on the energy E = h?k?/2pu of the 
incident particle is just as interesting as the -dependence of o(). In particular, 
in certain cases, one observes rapid variations of the total cross section ¢ 
in the neighborhood of certain energy values. For example, if one of the 
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phase shifts ò takes on the value 7/2 for E = Ey, the corresponding contri- 
bution to a reaches its upper limit and the cross section may show a sharp 
peak at E = E,. This phenomenon is called “scattering resonance”. We can 
compare it to the behavior described in chapter I (§ D-2-c-B) of the transmission 
coefficient of a “square” one-dimensional potential well. 


References and suggestions for further reading: 


Dicke and Wittke (1.14), chap. 16; Messiah (1.17), chap. X ; Schiff (1.18), chaps. 5 
and 9. 


More advanced topics: 


Coulomb scattering: Messiah (1.17), chap. XI; Schiff (1.18), §21; Davydov (1.20), 
chap. XI, §100. 

Formal collision and S-matrix theory: Merzbacher (1.16), chap. 19; Roman (2.3), 
part II, chap. 4; Messiah (1.17), chap. XIX; Schweber (2.16), part 3, chap. 11. 

Description of collisions in terms of wave packets: Messiah (1.17), chap. X, 
§§ 4, 5, 6; Goldberger and Watson (2.4), chaps. 3 and 4. 

Determination of the potential from the phase shifts (the inverse problem): Wu 
and Ohmura (2.1), § G. 

Applications: Davydov (1.20), chap. XI; Sobel’man (11.12), chap. 11; Mott and 
Massey (2.5); Martin and Spearman (16.18). 

Scattering by multi-particle systems in the Born approximation and space-time 
correlation functions: Van Hove (2.39). 
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COMPLEMENTS OF CHAPTER VII 


Chapter VIII, at the upper limit of the program, is principally intended to serve as a reference 
for other courses, for example in nuclear physics. Physical applications of collision theory can be 


found in these courses. 


Aym: THE FREE PARTICLE : STATIONARY STATES 
WITH WELL-DEFINED ANGULAR 
MOMENTUM 


Bym: PHENOMENOLOGICAL DESCRIPTION 
OF COLLISIONS WITH ABSORPTION 


Cum: SOME SIMPLE APPLICATIONS 
OF SCATTERING THEORY 


Avi: formal examination of stationary wave 
functions for a free particle with well-defined 
angular momentum. The use of the L, and L_ 
Operators'permits the introduction of spherical 
Bessel functions and the demonstration of a 
certain number of their properties which were 
used in §C of chapter VIII. 


Brin: permits the extension of the formalism 
of chapter VIII to collisions with absorption and 
establishes the “ optical theorem”, A phenomeno- 
logical point of view is used whose principle 
is analogous to that of complement K,,,. Not 
difficult if chapter VIII has been well assimilated. 


Cym : illustration of the results of chapter VIII 
by several specific examples. Section 1 is recom- 
mended for a first reading, since it presents 
important physical results in a simple manner 
(Rutherford's formula). Section 2 can be con- 
sidered as a worked example. Section 3 gives 
problems without their solutions. 
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Complement Aym 


THE FREE PARTICLE: 


STATIONARY STATES 
WITH WELL-DEFINED ANGULAR MOMENTUM 


l. The radial equation 

2. Free spherical waves 
a. Recurrence relations 
b. Calculation of free spherical waves 
c. Properties 


3. Relation between free spherical waves and plane waves 


We introduced, in $C-2 of chapter VIII, two distinct bases of stationary states 
of a free (spinless) particle whose Hamiltonian is written: 


p? 


(1) 


The first of these bases is composed of the eigenstates common to H, and the three 
components of the momentum P ; the associated wave functions are the plane waves. 
The second consists of the stationary states with well-defined angular momentum, 
that is, the eigenstates common to Ho, L? and L,, whose principal properties we 
pointed out in $$ C-2-b, c and d of chapter VIII. We intend to study here this second 
basis in more detail. In particular, we wish to derive a certain number of results 
used in chapter VIII. 


1. The radial equation 


The Hamiltonian (1) commutes with the three components of the orbital 
angular momentum L of the particle: 


[Ho L] =0 (2) 


Consequently, we can apply the general theory developped in $A of chapter VII 
to this particular problem. We therefore know that the free spherical waves 
(eigenfunctions common to Ho, L? and L.) are necessarily of the form: 


Petal) = REI) Y7. o) (3) 


(0) 
xl 


The radial function R, (r) is a solution of the equation : 


h? 1 d? Il + Dh? 
|- CLA ADE koa E Ri (4) 
2u r dr 2ur 
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0) 
Em 


where E, , is the eigenvalue of Hy corresponding to ¢°),,(r). If we set: 


l 
ROI) = = ui) (5) 


(0) 
K.i 


d (1 +1) i 
dr? r? h 


the function 1°) is given by the equation: 


QueE,, 
e Ju) = 0 6 


to which we must add the condition : 
u0) = 0 ds 


It can be shown, first of all, that equations (6) and (7) enable us to find the 
spectrum of the Hamiltonian A,, which we already know from the study of the 
plane waves [formula (C-5) of chapter VIII]. To do this, note that the minimum 
value of the potential (which is, in fact, identically zero) is zero and that consequently 
there cannot exist a stationary state with negative energy (cf. complement M,,,). 
Consider, therefore, any positive value of the constant E, , appearing in equation (6), 
and set: 


k = VE, (8) 


As r approaches infinity, the centrifugal term /(/ + 1)/r* becomes negligible 
compared to the constant term of equation (6), which can thus be approximated by: 


5 rejeo, 0 (9) 


Consequently, all solutions of equation (6) have an asymptotic behavior (linear 
combination of e**” and e” *") which is physically acceptable. Therefore, the only 
restriction comes from condition (7): we know (cf. chap. VII, §A-3-b) that there 
exists, for a given value of E, ,, one and only one function (to within a constant fac- 
tor) which satisfies (6) and (7). For any positive E, ,, the radial equation (6) thus has 
one and only one acceptable solution. 

Thus, the spectrum of H, indeed includes all positive energies. Moreover, 
we see that the set of possible values of E, , does not depend on /; we shall therefore 
omit the index / for the energies. As for the index x. we shall identify it with the 
constant defined in (8); this allows us to write: 


a-= o k 20 (10) 


Each of these energies is infinitely degenerate. Indeed, for fixed k. there 
exists an acceptable solution u{°)(r) of the radial equation corresponding to the 


energy E, for every value (positive integral or zero) of /. Moreover, formula (3) 
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associates (2/ + 1) independent wave functions ¢°/,,(r) with a given radial 


function u{°)(r). Thus, we again find in this particular case the general result 
demonstrated in §A-3-b of chapter VII : Hy, L? and L, form a C.S.C.O. in @,, 
and the specification of the three indices k, / and m gives sufficient information 
for the determination of a unique function in the corresponding basis. 


2. Free spherical waves 

The radial functions R{)(r) = z u40)(r) can be found by solving equation (6) 
or equation (4) directly. The latter is easily reduced (comment of §2-c-B below) 
to a differential equation known as the “spherical Bessel equation” whose 
solutions are well-known. Instead of using these results directly, we are going to see 


how the various eigenfunctions common to H,, L? and L, can be simply deduced 
from those which correspond to the eigenvalue 0 of L?. 


a. RECURRENCE RELATIONS 

Let us define the operator: 

Py = Py + iP, (11) 
in terms of the components P, and P, of the momentum P. We know that P is a 
vectorial observable (cf. complement By,, §5-c), which implies the following 


commutation relations* between its components and those of the angular momen- 
tum L: 


[£,,P,] = 90 
[L,,P,] = inP, (12) 
[L.,P,] = — ihP, 


and the equations which are deduced from these by cyclic permutation of the 
indices x, y, z. Using these relations, a simple algebraic calculation gives the 
commutators of L, and L? with the operator P, ; we find: 


[EP] Snr. (13-a) 
[L?, P,] = 2A(P LL, — P,L,) + 2h?P, (13-b) 


Consider therefore any eigenfunction ¢,°),,(r) common to Hy, L? and L,, the 
corresponding eigenvalues being E, , /(/ + 1)h? and mh. By applying the operators L , 
and L_, we can obtain the 2/ other eigenfunctions associated with the same energy E, 


and the same value of /. Since Hy commutes with L, we have, for example: 


H,¿L, A) = L,H, PE? all) = E,L, Pe lt) (14) 


* These relations can be obtained directly from the definition L = R x P and the canonical 
commutation rules. 
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and L, ø {°} mlr) (which is not zero if m is different from /) is an eigenfunction of H, 
with the same eigenvalue as ¢;°),,(r). Therefore: 


Ls Ora mit) © Pol ma 1 (1) (15) 


Let us now allow P, to act on ¢,°),,(r). First of all, since Hy commutes 


with P, we can repeat the preceding argument for P, p;%,,. Moreover, from rela- 
tion (13-a): 


LP, OR) mlr) P L, Plm +hP, POl m 


(m + 1h P, Pelt) (16) 


P pff} m is therefore an eigenfunction of L, with the eigenvalue (m + 1)h. If we 
use equation (13-b) in the same way, we see that the presence of the term P.L, 
implies that P,{°),, is not, in general, an eigenfunction of L?; nevertheless, 
ifm = I, the contribution of this term is zero: 


L?P, gi? P,L? Pert ES 2hP,L, Per + 2h*P, PO 


KL 


[i +1) +21 +2]4%P, gf, 
= (1 + 1) +2)4P, of, (17) 


Consequently, P, {f}, is a common eigenfunction of Hy, L? and L, with the 
eigenvalues E,, (/ + 1)(/ + 2)h? and (/ + 1)h respectively. Since these three obser- 
vables form a C.S.C.O. (§ 1), there exists only one eigenfunction (to within a constant 
factor*) associated with this set of eigenvalues : 


P, pert) oC Pers 1+ i(r) (18) 


We are going to use the recurrence relations (15) and (18) to construct the 
Lo), (1) ) basis from the functions °} ¿(r) corresponding to zero eigenvalues 


for L? and L,**. 


b. CALCULATION OF FREE SPHERICAL WAVES 


a. Solution of the radial equation for 1 = 0 


In order to determine the functions ¿“Y g(r), we return to the radial equa- 


tion (6), in which we set / = 0; taking definition (10) into account, this equation 
can therefore be written: 
d? 


E + e uo =0 (19) 


* Later (§2-b), we shall specify the coefficients which insure the orthonormalization of the 
{ of) mlr) } basis (in the extended sense, since k is a continuous index). 


** It must not be thought that the operator P_ = P, — iP, allows one to “step down” from 


x 


an arbitrary value of / to zero. It can easily be shown, by an argument analogous to the preceding one, 
that: 


P_ oy? (r) x Pers wn lr) 
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The solution which goes to zero at the origin [condition (7)] is of the form: 
u(r) = a, sin kr (20) 


We choose the constant a, such that the functions pù} 9(r) are orthonormal in the 
extended sense; that is: 


[Er sto ote = atk = K) 21) 
It is easy to show (see below) that condition (21) is satisfied if: 
E 
a, = Jn (22) 


which yields (Y$ being equal to 1/41): 


pr l1 sin kr 
Pe. o(r) = 
V r ydr 4n kr 


Let us verify that the functions (23) satisfy the orthonormalization relation (21). To do 
this. it is sufficient to calculate : 


2 1’ sin kr sin k'r 
d? (O)% (0) a= kk’ 2 d a 
| reo Pro.ol") n 4r | Poe k- kr [uo 


(23) 


2 $ 
=— | dr sin kr sin k'r (24) 
T 
Oo 


Replacing the sines by complex exponentials and extending the interval of integration over the 
range — æ% to + 90, we obtain: 


s 2 1 tos i , : f 
E | dr sin kr sin k'r = — (- 5) | di [et tE ger] (25) 
T T 4 


0 — 4s 


Since k and k' are both positive, k + k' is always different from zero and the contribution of the 
first term within the brackets is always zero. According to formula (34) of appendix II. the 
second term yields finally: 


[er ose) of of) = (| 20 atk 1) 


= dk — k’) (26) 


B. Construction of the other waves by recurrence 


Let us now apply the operator P, defined in (11) to the function p¿% ¿(r) 
that we have just found. According to relation (18): 


PA (AP, OL lr) 
sin kr 
= (27) 
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In the { |r > } representation, which we have been using throughout, P, is the 
differential operator: 


hfe 0 
pl ti) (28) 


In formula (27), it acts on a function of r alone. Now: 
l h/x vid , 
A A E fer 
Mr) i (; r) dr MH) 


h. w d 
= —$ eae Tr 
z Sin Ve dr T(r) (29) 


Thus we obtain: 


cos kr sin kr 
Z a (30) 


o (r) x sin 0 al 
ORG a 


We recognize the angular dependence of Y! (0, y) [complement A,,, formula (32)]; 
by applying L_, ¢{?! o(r) and {°} _ ,(r) can be calculated. 

Although ¢,"') ,(r) depends on 0 and ¢, the application of P, to this function 
remains very simple. The canonical commutation relations indicate immediately 


that: 
[P.X + iY] =0 (31) 


Consequently, 9°) ,(r) is given by: 
, Sin kr 
pe} ale) XK P; kr 
x +iy d sin kr 
r de kr 
Id sin kr 


(a+ imp. -S 
o rdr kr 


x(x + ala z] (32) 


Py 


rdr| rdr kr 


In general: 


_ (1d Y sin kr 
per lr) x (x + iy)! E 5) re (33) 


The angular dependence of ¢{°, is contained in the factor: 


(x + iv) = r'(sin 0) ef? (34) 


which is indeed proportional to Y'¡(0, q). 
Let us define: 


L d ) sin p (35) 


y — 1. iol LES, 
id) = (— Wo! (Oa) y 
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j, thus defined, is the spherical Bessel function of order |. The preceding calculation 
shows that {°} ,(r) is proportional to the product of Y;(0, p) and j,(kr). We shall 


write (see below the problem of normalization): 


ROX) = j ji(kr) (36) 


The free spherical waves are then written: 


PO) w(t) = ar He) Yr(9, p) (37) 


They satisfy the orthonormalization relation: 
| # ©2280) Penile) = A — K) de Ban (38) 
and the closure relation: 


[ dk YY galt) AA) = dle — Y) (39) 


1=0 m=-1 


Let us now examine the normalization of the functions (37). To do so, let us begin by 
specifying the proportionality factors of the recurrence relations (15) and (18). For the first 
relation, we already know this factor from the properties of spherical harmonics (cf. comple- 
ment Ay): 


LP (e) = AVL + 1) — mm E 1) 9 na l) (40) 


As for relation (18), it can easily be shown, using the explicit expression for Y ;(@, @) [formulas (4) 
and (14) of complement A,,], equations (31) and (29) and definition (35), that, if we take (37) 
into account, this relation can be written: 


hk pu +2 
Pole r) = F DE 7 +3 Otis ¡+ 1) (41) 


In the orthonormalization relation (38), the factors 6,;-6,.,° on the right-hand side arise 
from the angular integration and the orthonormality of the spherical harmonics. To establish 
relation (38), it is thus sufficient to show that the integral : 


I{k. k) = f d?r olr) p9 (0) (42) 


is equal to 6(k — k’), We already know from (26) that J,(k, k’) has this value. Consequently, 
we shall demonstrate that, if : 


I(k,k') = d(k — k’) (43) 
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then the same is true for 1,,,(k, k”). Relation (41) permits us to write J,,,(k, k') in the form : 


no 1 UA+3f, (0) (0) 
is — [er Pose LP pinl) (r)] 


_ 12143 
hk’ 21 +2 


| dr git) PP, op le) (44) 


where P_ = P, — iP, is the adjoint of P,. Now: 


PP, =P + P; =P? — p? (45) 


pí””, is an eigenfunction of P?. Since, in addition, P, is Hermitian, it results that : 


o L eee 
lak) = {14 2104) — [ar [Poon [P02 } (46 


We must now calculate P ol% ,(r). Using the fact that Y{(9, p) is proportional to (x + iy)'/r', 
we easily find: 
E, )2 


2k 
P Pert r) = = nce cos 0 Yi(0, o) Jia (kr) 


= — —— Pf le) (47) 


according to formula (35) of complement A,,. Putting this result into (46), we finally obtain: 


21+3K (k, k’) (48) 


] 1 
MaS a A aa ee 


Hypothesis (43) thus implies : 
Tha ¡(k,k') = òlk — k’) (49) 


which concludes the argument by recurrence. 


c. PROPERTIES 


a. Behavior at the origin 


When p approaches zero, the function j,(p) behaves (see below) like: 


: p 
i?) zo OTF NN (50) 
Consequently, {°),,() is proportional to r‘ in the neighborhood of the origin: 


Pk? om (kr) 
Prim) Zo ja © YT(8, Vor + DN (51) 
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To demonstrate formula (50), starting from definition (35), it is sufficient to expand 
sin p/p in a power series in p: 


x x 2p 
sin p p 
= — 1)?_—-—_ (52) 
A aan 
Lay 
We then apply the operator a) which yields : 
p dp 
Lay” 2 2p 
j = — 1) == = 2p- 1-1 
M0 eS) Er 
s p2p — 2X2p — 4) .. [2p — Al — 1N] 
= (Y A y pp? (53) 
2, Cp +1)! 
The first / terms of the sum (p = 0 to / — 1) are zero, and the (/ + 1)th ts written: 
21(21 — 2)(21 — 4)... 2 
0) xy o e OA (54) 


p>0 (27 +1)! 
which proves (50). 


B. Asymptotic behavior 


When their argument approaches infinity, the spherical Bessel functions are 
related to the trigonometric functions in the following way: 


TT 


, 1. 
ile), x, 58i0(0 — 15) (55) 


The asymptotic behavior of the free spherical waves is therefore: 


2k? sin (kr — In/2 E 
to, Er E (56) 
aly rox y T kr 
ld sin p a ; 
If we apply the operator EF once to , we can write ¡¡(p) in the form: 
p ap p 
1d\ '[cosp sinp 
ile) = (- ve (24) | E ze] (57) 
p dp p? p’ 


The second term inside the brackets is negligible compared to the fìrst term when p approaches 


es ld ] : A 
infinity. Moreover, when we apply > a second time, the dominant term still comes from the 
pap 


derivative of the cosine. Thus we see that: 


11 E 
ile) x, (= NWo'= = (>) sin p (58) 
p' p \dp 
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Since: 


(Es p =(- 1) sin (> Z 15) (59) 


the result is indeed (55). 


COMMENT: 
If we set: 
kr =p (60) 
[k being defined by formula (10)], the radial equation (4) becomes: 
h(a o 


This is the spherical Bessel equation of order /. It has two linearly independent 
solutions, which can be distinguished, for example, by their behavior at the 
origin. One of them is then the spherical Bessel function ¡,(p), which 
satisfies (50) and (55). For the other, we can choose the “spherical Neumann 
function of order /”, designated as n,(p), with the properties: 


21 — 1)!! 
nl (62-a) 
p 
1 T 
O cos | p — 15 (62-b) 
3. Relation between free spherical waves and plane waves 


We already know two distinct bases of eigenstates of H,: the plane waves v¿“(r) 
are eigenfunctions of the three components of the momentum P ; the free spherical 
waves pc), (r) are eigenfunctions of L? and L,. These two bases are different 
because P does not commute with L? and L,. 

A given function of one of these bases can obviously be expanded in terms 
of the other basis. For example, we shall express a plane wave v{®’(r) as a linear super- 
position of free spherical waves. Consider, therefore, a vector k in ordinary space. 
The plane wave vý (r) that it characterizes is an eigenfunction of H, with the 
eigenvalue h?k?/2u. Therefore, its expansion will include only the 9 (°),, which 
correspond to this energy, that is those for which: 


k = |k| (63) 


This expansion will therefore be of the form: 


+1 


HR) = Y Y clk) P(t) (64) 
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the free indices k and k being related by equation (63). It is easy to show, using 
the properties of the spherical harmonics (cf. complement A,,,) and the spherical 
Bessel functions, that: 


20 +1 


era dt Y Y i YPO P) jkr) YP. 9) (65) 


t=O m=-l! 


where 6, and q, are the polar angles that fix the direction of the vector k. If k is 
directed along Oz, expansion (65) reduces to: 
ez = Y Y 4m(21 +1) jo(kr) Y9(0) 
1=0 


Y ¡(21 + 1) (kr) P,(cos 0) (66) 


t=0 


where P, is the Legendre polynomial of degree / [cf. equation (57) of comple- 
ment A,, |. 


Let us first demonstrate relation (66). To do this, let us assume that the vector k chosen 
is collinear with Oz : 


k, =k, =0 (67) 
and points in the same direction. In this case, equation (63) becomes: 

k, =k (68) 
and we want to expand the function: 

gikz = eikrcoxo (69) 


in the { p(0),,(r) } basis. Since this function is independent of the angle g, it is a linear combi- 
nation of only those basis functions for which m = 0: 


x 


elkr cose = X a, eer (1) 
1=0 
= Y c flkr) YPO) (70) 


1=0 


ikr cos V 


To calculate the numbers «,, we can consider e to be a function of the variable 0, 
with r playing the role of a parameter. Since the spherical harmonics form an orthonormal basis 
for functions of 0 and ø, the “coefficient” c,j,(kr) can be expressed as: 


cı kr) = | dQ ¥PH(0) err? (71) 


Replacing Y? by its expression in terms of Y!(0. @) [formula (25) of complement Ay,], we 
obtain: 


c i(kr) = Tz E 6 ¥(0, o)| ikr cose 


1 | EN 
= —— da Y!*(0, ) E gir eos | (72) 
Jen! MAY 
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since L, is the adjoint operator of L_. Formula (16) of complement Ay, then yields: 


li 


L 
LX ara l : d 
( p = (— 1) eile (sin oy glkrcos 8 
h d(cos 0)! 


= (—- 1) ello (sin 0)! (ikr)! gikr cos O (73) 


Now (sin 0) e"? is just, to within a constant factor, Y (0, p) [cf. formulas (4) and (14) of 
complement A,,]. Consequently : 


21 4 l 
c, jkr) = (ir! = | | da |¥ (0, p)? eo? (74) 
Jony GF + 0! 


It is therefore sufficient to choose a particular value of kr, for which we know the value 
of j,(Ar), in order to calculate c,. Allow, for example, kr to approach zero: we know that /,(kr) 
behaves like (kr), and so, in fact, does the right-hand side of equation (74). More precisely, 
using (50), we find: 


ELA dQ |Y40. p)? (75) 
C e $ A, a a ` 
ED Jop VEF! aa 
that is, since Yj is normalized to 1: 
e, = ¿421 + 1) (76) 


This proves formula (66). 

The general relation (65) can therefore be obtained as a consequence of the addition 
theorem for spherical harmonics [formula (70) of complement A,,]. Whatever the direction 
of k (defined by the polar angles 0, and ọ,), it is always possible, through a rotation of the 
system of axes, to return to the case we have just considered. Consequently. expansion (66) 
remains valid, providing that kz is replaced by k.r and cos 0 by cosa, where a is the angle 
between k and r: 


s 


et" = Y i (21 + 1)ji(kr) P,(cos a) K 


l=0 


But the addition theorem for spherical harmonics permits the expression of P (cosa) in terms 
of the angles (0, p) and (0,. @,), which yields finally formula (65). 


The expansions (65) and (66) show that a state of well-defined linear momen- 
tum involves all possible orbital angular momenta. 

To obtain the expansion of a given function p¿%,,(r) in terms of plane waves, 
it is sufficient to invert formula (65). using the orthonormalization relation of 
spherical harmonics which are functions of 0, and y,. This yields: 


| dQ, YPO, 9,) eo" = Ani! (kr) YTO, p) (78) 
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Thus we find : 


OL) (E) == ees dQ, YT(A,, Pp) eX" (79) 


An eigenfunction of L? and L, is therefore a linear superposition of all plane waves 
with the same energy : a state of well-defined angular momentum involves all possible 
directions of the linear momentum. 


References: 


Messiah (1.17), App. B, §6; Arfken (10.4), §11.6; Butkov (10.8), chap. 9, §9; see 
the subsection “Special functions and tables” of section 10 of the bibliography. 


950 


www.elsolucionario.net 


COLLISIONS WITH ABSORPTION 


Complement By: 


PHENOMENOLOGICAL DESCRIPTION 
OF COLLISIONS WITH ABSORPTION 


1. Principle involved 
2. Calculation of the cross sections 


a. Elastic scattering cross section 
b. Absorption cross section 
c. Total cross section. Optical theorem 


In chapter VIII, we confined ourselves to the study of the elastic* scattering of particles 
by a potential. But we pointed out in the introduction of chap. VIII that collisions 
between particles can be inelastic and lead, under certain conditions, to numerous 
other reactions (creation or destruction of particles, etc...), particularly if the energy 
of the incident particles is high. When such reactions are possible, and one detects 
only elastically scattered particles, one observes that certain particles of the incident 
beam “disappear” ; that is, they are not to be found either in the transmitted beam 
or amongst the elastically scattered particles. These particles are said to be “absor- 
bed” during the interaction; in reality, they have taken part in reactions other 
than that of simple elastic scattering. If one is interested only in the elastic scattering, 
one seeks to describe the “absorption” globally, without going into detail about 
the other possible reactions. We are going to show here that the method of partial 
waves provides a convenient framework for such a phenomenological description. 


1. Principle involved 


We shall assume that the interactions responsible for the disappearance of the 
incident particles are invariant with respect to rotation about O. The scattering 
amplitude can therefore always be decomposed into partial waves, each of which 
corresponds to a fixed value of the angular momentum. 

In this section, we shall see how the method of partial waves can be modi- 
fied to take a possible absorption into consideration. To do this, let us 
return to the interpretation of partial waves that we gave in $ C-3-b-a of chapter VIII. 
A free incoming wave penetrates the zone of influence of the potential and gives 
rise to an outgoing wave. The effect of the potential is to multiply this outgoing 
wave by e*'. Since the modulus of this factor is 1 (the phase shift ô, is real), 
the amplitude of the outgoing wave is equal to that of the incoming wave. 
Consequently (see the calculation of §2-b below), the total flux of the incoming 
wave is equal to that of the outgoing wave: during the scattering, probability is 
conserved, that is, the total number of particles is constant. These considerations 
suggest that, in the cases where absorption phenomena occur, one can take them 
into account simply by giving the phase shift an imaginary part such that: 


le?*4,| <] (1) 


* A collision is called elastic if it changes neither the nature nor the internal state of the concerned 
particles; otherwise it is called inelastic. 
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The amplitude of the outgoing wave with angular momentum / is thus smaller 
than that of the incoming wave from which it arises. The fact that the outgoing 
probability flux is smaller than the incoming flux expresses the “disappearance” 
of a certain number of particles. 

We are going to make this idea more explicit and deduce from it the 
expressions for the scattering and absorption cross sections. However, we stress 
the fact that this is a purely phenomenological method: the parameters with which 
we shall characterize the absorption (modulus of e?*? for each partial wave) mask 
an often very complicated reality. Note also that if the total probability is no longer 
conserved it is impossible to describe the interaction by a simple potential. A correct 
treatment of the set of phenomena which can then arise during the collision 
would demand a more elaborate formalism than the one developed in chapter VIII. 


2. Calculation of the cross sections 


We return to the calculations of §C-4 of chapter VIII, setting: 
ny = 07 (2) 
Since the possibility of producing reactions other than that of elastic scattering 


is always expressed by a decrease in the number of elastically scattered particles, 
we must have: 


In < 1 (3) 
(equality corresponding to cases where only elastic scattering is possible). The 


asymptotic form of the wave function which describes the elastic scattering is 

therefore [ef. formula (C-51) of chapter VIII]: 

ao ela on nh a e"? (4) 
2ikr 


r> 


r) ~ — Y ian + 1) YAO) 


a. ELASTIC SCATTERING CROSS SECTION 


The argument of $C-4-a of chapter VIII remains valid and gives the scattering 
amplitude /,(0) in the form: 


140) = oS Vamal + 1) reo 12 (5) 


From this we deduce the differential elastic scattering cross section: 


5. (0) => > J 4n(21 + 1) Y?(0) "i = : (6) 


and the total elastic scattering cross section: 


T 


k? 


Ge = 


Yt yi np (7) 


952 


www.elsolucionario.net 


COLLISIONS WITH ABSORPTION 


COMMENT: 


According to the argument developed in $ 1, the absorption of the 
wave (/) reaches a maximum when |y,| is zero, that is, when: 


n, = 09 (8) 


Formula (7) indicates however that, even in this limiting case, the contribution 
of the wave (/) to the elastic scattering cross section is not zero*, In other 
words, even if the interaction region is perfectly absorbing, it produces elastic 
scattering. This important phenomenon is a purely quantum effect. It can be 
compared to the behavior of a light wave which strikes an absorbing medium. 
Even if the absorption is total (perfectly black sphere or disc), a diffracted 
wave is observed (concentrated into a solid angle which becomes smaller 
as the surface of the disc becomes larger). Elastic scattering produced by a 
totally absorbing interaction is called, for this reason, shadow scattering. 


b. ABSORPTION CROSS SECTION 


Following the same principle as in $A-3 of chapter VIII, we define the 
absorption cross section g,,.: it is the ratio between the number of particles 
absorbed per unit time and the incident flux. 

To calculate this cross section, it is sufficient, as in §B-2 of chapter VIII, 
to evaluate the total amount of probability AP which “disappears” per unit time. 
This probability can be obtained from the current J associated with the wave 
function (4). 44 is equal to the difference between the flux of the incoming waves 
across a sphere (S) of very large radius R, and that of the outgoing waves; it is 
therefore equal to minus the net flux of the vector J leaving this sphere. Thus: 


AP = — | J.dS (9) 
(S) 
with: 
= aito Ey fait 
J = Re| 2% rad: Ar) (10) 
Only the radial component J, of the current contributes to the integral (9): 
49 = - | J,r? dQ (11) 
r=Ro 
with: 
J, = Re | véif*(r) ae vd ifr) (12) 
$ k in or * 
* This contribution is zero only if y, = 1, that is, if the phase shift is real and equal to an integral 


multiple of z [this was already contained in formula (C-58) of chapter VIH]. 
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In formula (12), the differentiation does not modify the angular dependence 
of the various terms which compose v{*'™(r) [formula (4)]. Consequently, because 
of the orthogonality of the spherical harmonics, the cross terms between 
a partial wave (/) in v{#f{r) and a different wave (/') in v(W*(r) make a zero 
contribution to integral (11). We have therefore: 


4P = — | JO r? dQ (13) 
1=0 r=Ro 


where J,” is the radial component of the current associated with the partial wave (/). 
A simple calculation gives: 


hk r(21 +1 
1,2, a Bee (19 


that is, finally, since Y? (0) is normalized: 
so = EZ $ (21 +10) [1 P] (15) 


k? 1=0 


The absorption cross section a, is therefore equal to the probability 47 


divided by the incident current hk/p: 
2 (22 +1 [1 — (m7 (16) 


T 


k2 


O, 


abs 


It is obvious that cp, is zero if all the 7, have a modulus of 1; that is, according 
to (2), if all the phase shifts are real. In this case, there is only elastic scattering, 
and the net flux of probability leaving a sphere of large radius Rọ is always zero. 
The total probability carried by the incoming waves is entirely transferred to the 
outgoing waves. On the other hand, when y, is zero, the contribution of the wave (/) 
to the absorption cross section is maximum. 


COMMENT: 
: , hk 
The calculation of expression (15) shows that == (21 + 1) is the amount of 


probability entering per unit time, and arising from the partial wave (/). Il we divide this 
quantity by the incident current hk/y, we obtain a surface that can be called the “incoming 
cross section into the partial wave (/)” : 


= (21 + 1) (17) 
k2 


This formula can be interpreted classically. We can consider the incident plane wave as 
describing a beam of particles of uniform density, having a momentum hk parallel to Oz. 
What proportion of these particles reach the scattering potential, with an angular 


momentum fi Jf (I + 1)?We have already mentioned the link between angular momentum 
and the impact parameter in classical mechanics [cf. formula (C-23) of chapter VIII]: 


|F| = b |p| = k b (18) 
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All we must do. therefore, is to draw, in the plane passing through O and perpendicular 
to Oz, a circular ring centered at O, of average radius 5, such that: 


AN + 1) = hk b, (19) 


and of width 4b, corresponding to A/ = 1 in formula (19) (fig. 1). All the particles crossing 


this surface reach the scattering potential with an angular momentum equal to f v I(l + 1), 
to within h. From (19) we derive: 


1 re | 1 
=—J/il +1)x-(1+- 2 
b= ¿VUE +1) el 5) (20) 
if / > 1, and consequently: 
áb, = : 1 
l k (21) 
The area of the circular ring of figure 1 is therefore : 
27 b, Ab, = Za +1) (22) 


k 


Thus we find again, very simply, o,. 


FIGURE | 


The incident particles must reach the potential with the impact parameter b, to within 4b, for their 
classical angular momentum to be hy Kl + 1) to within A. 


955 


www.elsolucionario.net 


COMPLEMENT By 


c. TOTAL CROSS SECTION. OPTICAL THEOREM 

When a collision can give rise to several different reactions or scattering 
phenomena, the total cross section ¢,,, is defined as the sum of the cross sections 
(integrated over all the directions of space) corresponding to all these processes. 
The total cross section is thus the number of particles which, per unit time, partici- 
pate in one or another of the possible reactions, divided by the incident flux. 

If, as above, we treat globally all reactions other than elastic scattering, we 
have simply: 


Tiot = Gal + Fabs (23) 
Formulas (7) and (16) then give: 
or 
Gian = 757 > (21 + na — Ren) (24) 
k^ 120 
Now (1 — Ren,) is the real part of (1 — 4,), which appears in the elastic 


scattering amplitude [formula (5)]. Moreover, we know the value of Y?(0) forð = 0: 


(25) 


Lef. complement Ay, formulas (57) and (60)]. Consequently, if we calculate from (5) 
the imaginary part of the elastic scattering amplitude in the forward direction, we 
find: 


Le 1 — Re 
Im (0) =7 Y Ql + y == (26) 
t=0 
Comparing this expression to formula (24), we see that: 
4r 
Su = Im f0) (27) 


This relation between the total cross section and the imaginary part of the elastic 
scattermg amplitude in the forward direction is valid in a very general sense; it 
constitutes what is called the optical theorem. 


COMMENT: 


The optical theorem is obviously valid in the case of purely elastic 
scattering (Cabs = 0; Gi,= Ca) The fact that f(0) — that is, the wave 
scattered in the forward direction — is related to the total cross section could 
have been predicted from the discussion in §B-2-d of chapter VIII. It is the 
interference in the forward direction between the incident plane wave and the 
scattered wave which accounts for the attenuation of the transmitted beam, 
due to the scattering of particles in all directions of space. 


References and suggestions for further reading: 


Optical model: Valentin (16.1), $X-3. High energy proton-proton collisions : 
Amaldi (16.31). 
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SOME SIMPLE APPLICATIONS OF SCATTERING THEORY 


1. The Born approximation for a Yukawa potential 
a. Calculation of the scattering amplitude and cross section 
b. The infinite-range limit 


2. Low energy scattering by a hard sphere 
3. Exercises 


a. Scattering of the p wave by a hard sphere 
b. * Square spherical well” : bound states and scattering resonances 


There is no potential for which the scattering problem can be solved exactly* by a 
simple analytical calculation. Therefore, in the examples that we are going to 
discuss here, we shall content ourselves with using the approximations that we 
introduced in chapter VIII. 


1. The Born approximation for a Yukawa potential 


Let us consider a potential of the form: 


(1) 


where V) and a are real constants, with a positive. This potential is attractive 
or repulsive depending on whether Y, is negative or positive. The larger ||, the 
more intense the potential. Its range is characterized by the distance: 


(2) 


since, as figure | shows, V(r) is practically zero when r exceeds 2r, or 3rg. 

The potential (1) bears the name of Yukawa, who had the idea of associating 
it with nuclear forces, whose range is of the order of a fermi. To explain the origin 
of this potential, Yukawa was led to predict the existence of the z-meson, which 
was indeed later discovered. Notice that for « = 0 this potential becomes the 
Coulomb potential, which can thus be considered to be a Yukawa potential of 
infinite range. 


* Actually, we can rigorously treat the case of the Coulomb potential; however, as we pointed 
out in chapter VIII (§B-1), this necessitates a special method. 
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AV(r) 


FIGURE 1 


Yukawa potential and Coulomb potential. The presence of the term e ” causes the Yukawa 
potential to approach zero much more rapidly when r > rọ = 1/a (range of the potential). 


a. CALCULATION OF THE SCATTERING AMPLITUDE AND CROSS SECTION 


We assume that || is sufficiently small for the Born approximation 
(§ B-4 of chapter VIII) to be valid. According to formula (B-47) of chapter VIII, 
the scattering amplitude f{*(0, @) is then given by: 


1 2pV, ee 
(B) — _ + ZHYo | 33 ¡Kr 
FOO, o) = PT pa re > (3) 


where K is the momentum transferred in the direction (0, @) defined by rela- 
tion (B-42) of chapter VIII. 

Expression (3) involves the Fourier transform of the Yukawa potential. Since 
this potential depends only on the variable r, the angular integrations can easily 
be carried out (§2-e of ‘appendix I), putting the scattering amplitude into the 
form: 


A 
4x h? |K| 


FPO, p) = e 


Ñ . ga 
[ r dr sin |K|r 7 (4) 


After a simple calculation, we then find: 


_2uY, 1 
h? a? + jK)? 


SEO, p) = 
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Figure 6 of chapter VIII shows that: 
. 0 
|K| = 2k sin 5 (6) 


where k is the modulus of the incident wave vector and 6 is the scattering angle. 
The differential scattering cross section is therefore written, in the Born 
approximation : 


_ 4V2 1 


o 80 A ee 
(0) ht Ta? + 4k? sin? 9/2]? 


(7) 


It is independent of the azimuthal angle y, as could have been foreseen from 
the fact that the problem of scattering by a central potential is symmetrical with 
respect to rotation about the direction of the incident beam. On the other hand, 
it depends, for a given energy (that is, for fixed k), on the scattering angle: 
in particular, the cross section in the forward direction (0 = 0) is larger than the 
cross section in the backward direction (0 = z). Finally, o(6), for fixed 0, is a 
decreasing function of the energy. Notice, moreover, that the sign of VW is of no 
importance in the scattering problem, at least in the Born approximation. 

The total scattering cross section is easily obtained by integration: 


4u’ V? An 
© — [ag cmo) = 4H Vo an _ (8) 
7 | oa) ht (a? + 4k?) 


b. THE INFINITE-RANGE LIMIT 


We noted above that the Yukawa potential approaches a Coulomb potential 
when a tends towards zero. What happens, in this limiting case, to the formulas 
that we have just established ? 

To obtain the Coulomb interaction potential between two particles having 
charges of Z,q and Z,q (q being the charge of the electron), we write : 


a=0 
Y) = ZZ, e (9) 
with : 
2 
2_ 4 
a (10 


Formula (7) then gives: 


4u? ZiZje* 


0 


08) = 
h* 16 k* sint 5 


2 72 p4 
_ ZziZ3€ 


E 0 
E 
16E* sin > 


(11) 
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(k has been replaced by its value in terms of the energy). 

Expression (11) is indeed that of the Coulomb scattering cross section 
(Rutherford's formula). Of course, the way in which we have obtained it does not 
constitute a proof: the theory we have used is not applicable to the Coulomb 
potential. However, it is interesting to observe that the Born approximation for 
the Yukawa potential gives precisely Rutherford’s formula for the limiting situation 
where the range of the potential approaches infinity. 


COMMENT: 


The total scattering cross section for a Coulomb potential is infinite 
since the corresponding integral diverges for small values of 0 [expression (8) 
becomes infinite when a approaches zero]. This results from the infinite 
range of the Coulomb potential : even if the particle passes very far from the 
point O, it is affected by the potential. This suggests why the scattering cross 
section should be infinite. However, in reality, one never observes a rigorously 
pure Coulomb interaction over an infinite range. The potential created by a 
charged particle is always modified by the presence, in its more or less 
immediate neighborhood, of other particles of opposite charge (screening 
effect). 


2. Low energy scattering by a hard sphere 


Let us consider a central potential such that: 
Vir) =0 for r > Fo 
= 0 ‘for r<ro (12) 


In this case, we say that we are considering a “hard sphere” of radius rọ. We assume 
that the energy of the incident particle is sufficiently small for Arg to be much 
smaller than 1. We can then ($ C-3-b-B of chapter VIII and exercise 3-a below) 
neglect all the phase shifts except that of the s wave (/ = 0). The scattering 
amplitude /,(0) is written, under these conditions : 


1, : 
fO) = i eo sin d4(k) (13) 
(since Y} = 1// 4n); the differential cross section is isotropic : 
Ls 
o(0) = 11,0)? = PE sin? 5,(k) (14) 
so that the total cross section is simply equal to: 


4x. 
o= a sin? So(k) (15) 
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To calculate the phase shift 9¿(4), it is necessary to solve the radial equation 
corresponding to / = 0. This equation is written [ cf. formula (C-35) of chapter VIII]: 


E + e haot =0 for r>ro (16) 


which must be completed by the condition: 
Uy olo) = 0 (17) 


since the potential becomes infinite for + = ro. The solution u, y(+) of equations (16) 
and (17) is unique to within a constant factor: 


up olr) = C sin k(r — ro) for F=> Fo 
= 0) for r< fo (18) 


The phase shift 59 is, by definition, given by the asymptotic form of u, olr): 

Uy olr) 7 Sin (kr + do) (19) 
Thus, using solution (18), we find: 

do(k) = — kro (20) 


If we insert this value into expression (15) for the total cross section, we obtain: 


dr . 

o = = sin? kr, = 4nr2 (21) 
since by hypothesis kr, is much smaller than 1. Therefore, o is independent of the 
energy and equal to four times the apparent surface of the hard sphere seen by the 
particles of the incident beam. A calculation based on classical mechanics would 
give for the cross section the apparent surface zr : only the particles which bounce 
elastically off the hard sphere would be deflected. In quantum mechanics, however, 
one studies the evolution of the wave associated with the incident particles, and the 
abrupt variation of V(r) at f = rọ produces a phenomenon analogous to the 
diffraction of a light wave. 


COMMENT: 


Even when the wavelength of the incident particles becomes negligible 
compared to ro (Arg > 1), the quantum cross section does not approach zr2. 
It is possible, for very large A, to sum the series which gives the total cross 
section in terms of phase shifts [formula (C-58) of chapter VIII]; we then 
find: 


0, ~ nro (22) 
Wave effects thus persist in the limiting case of very small wavelengths. 
This is due to the fact that the potential under study is discontinuous 
at r = rọ} it always varies appreciably within an interval which is smaller 
than the wavelength of the particles (cf. chap. I, $D-2-a). 
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3. Exercises 


a. SCATTERING OF THE p WAVE BY A HARD SPHERE 


We wish to study the phase shift ô (k) produced by a hard sphere on 
the p wave (/ = 1). In particular, we want to verify that it becomes negligible 
compared to 6,(k) at low energy. 


a. Write the radial equation for the function u, ,(r) for r > ro. Show that 
its general solution is of the form: 


Uy (r) = de — Cos kr + do + sin to) | 


where C and a are constants. 


p. Show that the definition of 6,(k) implies that: 
a = tan 6,(k) 
y. Determine the constant a from the condition imposed on u, ,(r) at r = ro. 


ô. Show that, as k approaches zero, 6,(k) behaves like* (kr,)°, which makes 
it negligible compared to 6,(k). 


b. “SQUARE SPHERICAL WELL” : 
BOUND STATES AND SCATTERING RESONANCES 


Consider a central potential V(r) such that : 
V(r) = — V, for FS fo 
= 0 for F >to 


where V, is a positive constant. Set: 


k = [2u Vo 
o 


f 


VK 


We shall confine ourselves to the study of the s wave (/ = 0). 


a. Bound states (E < 0) 


(i) Write the radial equation in the two regions r > rọ and r < ro, as well 
as the condition at the origin. Show that, if one sets: 


[— 24E 
K = Vk — p° 


* This result is true in general : for any potential of finite range ro, the phase shift 5,(k) behaves 
like (kr,)?'*! at low energies. 
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the function u(r) is necessarily of the form: 
u(r) = Ae? for r >To 
= Bsin Kr for r< ro 


(ii) Write the matching conditions at r = rọ. Deduce from them that the only 
possible values for p are those which satisfy the equation : 


tan Kr, = — = 


p 


(iii) Discuss this equation: indicate the number of s bound states as a function 
of the depth of the well (for fixed rọ) and show, in particular, that there are no 
bound states if this depth is too small. 


B. Scattering resonances (E > 0) 


(i) Again write the radial equation, this time setting : 


K' = Vk +h 


Show that u, ¿(r) is of the form: 
= Bsin K'r for r< fo 
(ii) Choose A = 1. Show, using the continuity conditions at r = r,, that the 
constant B and the phase shift 6) are given by: 
B? = — e _ 
k? + k2 cos? K'r, 
with: 
k i 
tan a(k) = p tan K'ro 


(iii) Trace the curve representing B? as a function of k. This curve clearly 
shows resonances, for which B? is maximum. What are the values of k associated 
with these resonances? What is then the value of æ (k)? Show that, if there exists 
such a resonance for a small energy (kr, < 1), the corresponding contribution 
of the s wave to the total cross section is practically maximal. 
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y. Relation between bound states and scattering resonances 


; T i : 
Assume that koro is very close to (2n + 1) 5 where n is an integer, and set : 


T 


koro = (2n +1) 5 


+e with le] < 1 


(i) Show that, if « is positive, there exists a bound state whose binding 
energy E = — h*p?/2p is given by: 


p = eko 


(ii) Show that if, on the other hand, « is negative, there exists a scattering 
resonance at energy E = h?k?/2u such that: 


2k oe 


Fo 


k? o — 


(iii) Deduce from this that if the depth of the well is gradually decreased 
(for fixed rọ), the bound state which disappears when koro passes through an odd 
multiple of z/2 gives rise to a low energy scattering resonance. 


References and suggestions for further reading : 
Messiah (1.17), chap. IX, $10 and chap. X., $$ HI and IV; Valentin (16.1), Annexe H. 
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Until now, we have considered the electron to be a point particle possessing 
three degrees of freedom associated with its three coordinates x, y and z. 
Consequently, the quantum theory that we have developed is based on the hypothesis 
that an electron state, at a given time, is characterized by a wave function w(x, y, z) 
which depends only on x, y and z. Within this framework, we have studied a certain 
number of physical systems : amongst others, the hydrogen atom (in chapter VII), 
which is particularly interesting because of the very precise experiments that can be 
performed on it. The results obtained in chapter VII actually describe the emission 
and absorption spectra of hydrogen very accurately. They give the energy levels 
correctly and make it possible to explain, using the corresponding wave functions, 
the selection rules (which indicate which frequencies, out of all the Bohr frequencies 
which are a priori possible, appear in the spectrum). Atoms with many electrons 
can be treated in an analogous fashion (by using approximations, however, since 
the complexity of the Schródinger equation, even for the helium atom with two 
electrons, makes an exact analytic solution of the problem impossible). In this case 
as well, agreement between theory and experiment is satisfying. 

However, when atomic spectra are studied in detail, certain phenomena appear, 
as we shall see, which cannot be interpreted within the framework of the theory 
that we have developed. This result is not surprising. It is clear that it is necessary 
to complete the preceding theory by a certain number of re/ativistic corrections: 
one must take into account the modifications brought in by re/ativistic kinematics 
(variation of mass with velocity, etc.) and magnetic effects which we have neglected. 
We know that these corrections are small ($ C-4-a of chapter VII) : nevertheless, they 
do exist, and can be measured. 

The Dirac equation gives a relativistic quantum mechanical description of the 
electron. Compared to the Schródinger equation, it implies a profound modification 
in the quantum description of the properties of the electron; in addition to the 
corrections already pointed out concerning its position variables, a new characteristic 
of the electron appears: its spin. In a more general context, the structure of the 
Lorentz group (group of relativistic space-time transformations) reveals spin to 
be an intrinsic property of various particles, on the same footing, for example, as 
their rest mass*. 

Historically, electron spin was discovered experimentally before the intro- 
duction of the Dirac equation. Furthermore, Pauli developed a theory which allowed 


* This does not mean that spin has a purely relativistic origin: it can be deduced from the 
structure of the non-relativistic transformation group (the Galilean group). 
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spin to be incorporated simply into non-relativistic quantum mechanics* through 
the addition of several supplementary postulates. Theoretical predictions for the 
atomic spectra are then obtained which are in excellent agreement with experimental 
results **. 

It is Pauli’s theory, which is much simpler than Dirac's, that we are going to 
develop in this chapter. We shall begin, in §A, by describing a certain number of 
experimental results, which revealed the existence of electron spin. Then we shall 
specify the postulates on which Pauli’s theory is based. Afterwards, we shall examine, 
in $B, the special properties of an angular momentum 1/2. Finally, we shall show, 
in $C, how one can take into account simultaneously the position variables and the 
spin of a particle such as the electron. 


A. INTRODUCTION OF ELECTRON SPIN 


1. Experimental evidence 


Experimental demonstrations of the existence of electron spin are numerous 
and appear in various important physical phenomena. For example, the magnetic 
properties of numerous substances, particularly of ferromagnetic metals, can only 
be explained if spin is taken into account. Here, however, we are going to confine 
ourselves to a certain number of simple phenomena observed experimentally in 
atomic physics: the fine structure of spectral lines, the Zeeman effect and, finally, 
the behavior of silver atoms in the Stern-Gerlach experiment. 


a. FINE STRUCTURE OF SPECTRAL LINES 


The precise experimental study of atomic spectral lines (for the hydrogen 
atom, for example) reveals a fine structure: each line is in fact made up of several 
components having nearly identical frequencies*** but which can be clearly 
distinguished by a device with good resolution. This means that there exist groups 
of atomic levels which are very closely spaced but distinct. In particular, the 
calculations of $C of chapter VII give the average energies of different groups 
of levels for the hydrogen atom but do not explain the splittings within each 


group. 


* Pauli's theory can be obtained as a limiting case of Dirac's theory when the electron's speed 
is small compared to that of light. 


** We shall see, for example in chapter XII where the general perturbation theory treated 
in chapter XI is used, how relativistic corrections and the existence of spin enable us to account 
quantitatively for the details of the hydrogen atomic spectrum (which would be inexplicable if we 
limited ourselves to the theory of chapter VII). 


*** For example, the resonance line of the hydrogen atom (2p <=> Is transition) is actually 
double: the two components are separated by an interval of the order of 1074 eV (that is, about 10% times 
smaller than the average 2p <-> Is transition energy, which is equal to 10.2 eV). 
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b. « ANOMALOUS » ZEEMAN EFFECT 


When an atom is placed in a uniform magnetic field, each of its lines (that 
is, each component of the fine structure) splits into a certain number of equidistant 
lines, the interval being proportional to the magnetic field : this is the Zeeman effect. 
The origin of the Zeeman effect can be easily understood by using the results of 
chapters VI and VII (complement Dy). The theoretical explanation is based on 
the fact that a magnetic moment M is associated with the orbital angular 
momentum L of an electron: 


M = L (A-1) 


where uy is the “ Bohr magneton”: 


gh 
Ha = 2m, (A-2) 


However, while this theory is confirmed by experiment in certain cases (the so-called 
“normal” Zeeman effect), it is, in other cases, incapable of accounting quantitatively 
for the observed phenomena (the so-called “anomalous” Zeeman effect). The most 
striking “anomaly” appears for atoms with odd atomic number Z (in particular, 
for the hydrogen atom): their levels are divided into an even number of Zeeman 
sub-levels, while, according to the theory, this number should always be odd, being 
equal to (2/ + 1) with / an integer. 


c. EXISTENCE OF HALF-INTEGRAL ANGULAR MOMENTA 


We are confronted with the same difficulty in connection with the Stern- 
Gerlach experiment, which we described in chapter IV ($A-1); the beam of silver 
atoms is split symmetrically in two. These results suggest that kalf-integral values of j 
(which we saw in $C-2 of chapter VI to be a priori possible) do indeed exist. 
But this poses a serious problem, since we showed in $D-1-b of chapter VI that 
the orbital angular momentum of a particle such as an electron could only be integral 
(more precisely, it is the quantum number / which is integral). Even in atoms with 
several electrons, each of these has an integral orbital angular momentum, and 
we shall show in chapter X that, under these conditions, the total orbital angular 
momentum of the atom is necessarily integral. The existence of half-integral angular 
momenta thus cannot be explained without supplementary hypotheses. 


COMMENT: 


It is not possible to measure directly the angular momentum of the 
electron using the Stern-Gerlach apparatus. Unlike silver atoms, electrons 
possess an electric charge g, and the force due to the interaction between 
their magnetic moment and the inhomogeneous magnetic field would be 
completely masked by the Lorentz force qv x B. 
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2. Quantum description: postulates of the Pauli theory 


In order to resolve the preceding difficulties, Uhlenbeck and Goudsmit (1925) 
proposed the following hypothesis: the electron “spins” and this gives it an intrinsic 
angular momentum which is called the spin. To interpret the experimental results 
described above, one must also assume that a magnetic moment M, is associated 
with this angular momentum S* : 


M, = 2 c S (A-3) 
Note that the coefficient of proportionality between the angular momentum and 
the magnetic moment is twice as large in (A-3) as in (A-1): one says that the spin 
gyromagnetic ratio is twice the orbital gyromagnetic ratio. 

Pauli later stated this hypothesis more precisely and gave a quantum 
description of spin which is valid in the non-relativistic limit. To the general 
postulates of quantum mechanics that we set forth in chapter HI must be added 
a certain number of postulates relating to spin. 

Until now, we have studied the quantization of orbital variables. With the 
position r and the momentum p of a particle such as the electron, we associated 
the observables R and P acting in the state space &,, which is isomorphic to the 
space F of wave functions. All physical quantities are functions of the fundamental 
variables r and p, and the quantization rules enable us to associate with them 
observables acting in &,. We shall call &, the orbital state space. 

To these orbital variables we shall add spin variables which satisfy the following 
postulates : 


(i) The spin operator S is an angular momentum. This means ($ B-2 of chapter VI) 
that its three components are observables which satisfy the commutation relations: 


[S,, S,] = mS, (A-4) 
and the two formulas which are deduced by cyclic permutation of the indices x, y, z. 


(ii) The spin operators act in a new space, the “spin state space” & , where S? 
and S, constitute a C.S.C.O. The space &, is thus spanned by the set of eigen- 
states | s, m >) common to S? and S,: 


S? |s, m> = s(s + 1)h? | s, m> (A-5-a) 
S, |s,m > = mh|s,m) (A-5-b) 
According to the general theory of angular momentum (§C of chapter VI), we 
know that s must be integral or half-integral and that m takes on all values included 


between — sand + s which differ from these two numbers by an integer (which may 
be zero). A given particle is characterized by a unique value of s: this particle is 


* Actually, when one takes into account the coupling of the electron with the quantized 
electromagnetic field (quantum electrodynamics), one finds that the coefficient of proportionality 
between My and S is not eactly 2u,/h. The difference, which is of the order of 107° in relative value, 
is easily observable experimentally ; it is often called the “anomalous magnetic moment” of the electron. 


970 


www.elsolucionario.net 
A. INTRODUCTION OF ELECTRON SPIN 


said to have a spin s. The spin state space &, is therefore always of finite dimen- 
sion (2s + 1), and all spin states are eigenvectors of S* with the same eigenvalue 
s(s + 1)h?. 

(iii) The state space & of the particle being considered is the tensor product 
of €, and é,: 


6=6,@6, (A-6) 


Consequently (§F of chapter II), all spin observables commute with all orbital 
observables. 

Except for the particular case where s = 0, it is therefore not sufficient to 
specify a ket of &, (that is, a square-integrable wave function) to characterize a state 
of the particle. In other words, the observables X, Y and Z do not constitute 
a C.S.C.O. in the space state & of the particle (no more than do P,, P,, P, or any 
other C.S.C.O. of &,). It is also necessary to know the spin state of the particle, 
that is, to add to the C.S.C.O. of $, a C.S.C.O. of £, composed of spin observables, 
for example, S? and S, (or S? and S,). Every particle state is a linear combination of 
vectors which are tensor products of a ket of $, and a ket of &, (see §C below). 

(iv) The electron is a spin 1/2 particle (s = 1/2) and its intrinsic magnetic 
moment is given by formula (A-3). For the electron, the space &, is therefore two- 
dimensional. 


COMMENTS: 


(i) The proton and the neutron, which are nuclear constituents, are also 
spin 1/2 particles, but their gyromagnetic ratios are different from that of the 
electron. At the present time we know of the existence of particles of 
spin 0, 1/2, 1, 3/2, 2, ... up to higher values such as 11/2. 

(ii) In order to explain the existence of spin, we could imagine that a particle 
like the electron, instead of being a point, has a certain spatial extension. It 
would then be the rotation of the electron about its axis that would give rise 
to an ‘intrinsic angular momentum. However, it is important to note that, 
in order to describe a structure that is more complex than a material point, 
it would be necessary to introduce more than three position variables. If, 
for example, the electron behaved like a solid body, six variables would be 
required : three coordinates to locate one of its points chosen once and for 
all, such as its center of gravity, and three angles to specify its orientation 
in space. The theory that we are considering here is radically different. It 
continues to treat the electron like’a point (its position is fixed by three 
coordinates). The spin angular momentum is not derived from any position 
or momentum variable*. Spin thus has no classical analogue. 


* If it were, moreover, it would necessarily be integral. 
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B. SPECIAL PROPERTIES OF AN ANGULAR MOMENTUM 1/2 


We shall restrict ourselves from now on to the case of the electron, which is 
a spin 1/2 particle. From the preceding chapters, we know how to handle its orbital 
variables. We are now going to study in more detail its spin degrees of freedom. 

The spin state space &, is two-dimensional. We shall take as a basis the 
orthonormal system {| + >, | — >} of eigenkets common to S? and S, which 
satisfy the equations: 


S |+) =| +) (B-La) 
1 

AED ES (B-1-b) 

<+|->=0 (B-2-a) 

<+|+>=<-]|->=1 (B-2-b) 

Ip ocr) O |S (B-3) 


The most general spin state is described by an arbitrary vector of 6,: 
ee eae eS (B-4) 


where c, and c_ are complex numbers. According to (B-1-a), all the kets of &, 
are eigenvectors of S? with the same eigenvalue 37/4, which causes S? to be 
proportional to the identity operator of &@,: 


3 
a h? < 
S 7 (B-5) 
Since S is, by definition, an angular momentum, it possesses all the general 
properties derived in $C of chapter VI. The action of the operators: 


S, = S, tiS, (B-6) 


on the basis vectors | + > and | — > is given by the general formulas (C-50) of 
chapter VI when one sets j = s = 1/2: 


S,|+> =0 S.|-—> =h| +) (B-7-a) 
S.|+>=f|-> S_|-> =0 (B-7-b) 


Any operator acting in $, can be represented, in the {| + >, | — > } basis, by 
a 2 x 2 matrix. In particular, using (B-1-b) and (B-7), we find the matrices 
corresponding to S,, S, and S, in the form: 

h 


(S) =50 (B-8) 
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where 6 designates the set of the three Pauli matrices : 


Li 0 
a 0 1 pE (? ) E (; ) (B-9) 
1 0 i 0 0 —1 
The Pauli matrices possess the following properties, which can easily be 
verified from their explicit form (B-9) (see also complement Ay): 


o = 0 =o, =1 (B-10-a) 
0,0, +00, = 0 (B-10-b) 
[o,,0,] = 210, (B-10-c) 
0,0, = Io, (B-10-d) 


(to the last three formulas must be added those obtained through cyclic permu- 
tation of the x, y, z indices). It also follows from (B-9) that: 


Tro, = Tro, = Tro, = 0 (B-11-a) 
Det o, = Det o, = Deto, = — 1 (B-11-b) 


Furthermore, any 2 x 2 matrix can be written as a linear combination, with 
complex coefficients, of the three Pauli matrices and the unit matrix. This is simply 
due to the fact that a 2 x 2 matrix has only four elements. Finally, it is easy to 
derive (see complement A, y) the following identity : 


(5.A)J(c.B)=A.B+i6.(A x B) (B-12) 


where A and B are two arbitrary vectors, or two vector operators whose three 
components commute with those of the spin S. If A and B do not commute with 
each other, the identity remains valid if A and B appear in the same order on the 
right-hand side as on the left-hand side. 

The operators associated with electron spin have all the properties which 
follow directly from the general theory of angular momentúm. They have, in 
addition, some specific properties related to their particular value of s (that is, of j), 
which is the smallest one possible (aside from zero). These specific properties 
can be deduced directly from (B-8) and formulas (B-10): 


e 
St =? = SP = 4 (B-13-a) 
5,5, +5S,S, =0 (B-13-b) 
S,S, = Shs, (B-13-c) 
S = 52 =0 (B-13-d) 
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C. NON-RELATIVISTIC DESCRIPTION 
OF A SPIN 1/2 PARTICLE 


We now know how to describe separately the external (orbital) and the internal 
(spin) degrees of freedom of the electron. In this section, we are going to assemble 
these different concepts into one formalism. 


1. Observables and state vectors 


a. STATE SPACE 


When all its degrees of freedom are taken into account, the quantum state of 
an electron is characterized by a ket belonging to the space & which is the tensor 
product of &, and 6, ($ A-2). 

We extend into &, following the method described in $ F-2-b of chapter II, 
both the operators originally defined in &, and those which initially acted in &, 
(we shall continue to use the same notation for these extended operators as for the 
operators from which they are derived). We thus obtain a C.S.C.O. in € through 
the juxtaposition of a C.S.C.O. of £, and one of &,. For example, in &,, we can 
take S? and S, (or S? and any component of S). In &,, we can choose { X, Y, Z }, 
or { Pe Pp P, }, or, if H designates the Hamiltonian associated with a central 
potential, { H, L?, L, } etc. From this we deduce various C.S.C.O. in £: 


{ X, Y, Z, S?, S, } (C-1-a) 
{Py Py Py SS} (C-1-b) 
{ H, L?, L,, S?, S, } (C-1-c) 


etc. Since all kets of £ are eigenvectors of S? with the same eigenvalue [formula (B-5)], 
we can omit S? from the sets of observables. 

We are going to use here the first of these C.S.C.O., (C-1-a). We shall take 
as a basis of & the set of vectors obtained from the tensor product of the kets 
|r>=|x, y, z > of 8, and the kets |e > of £: 


ne» =|x,y,z6> = |r> 0 led (C-2) 


where the x, y, z, components of the vector r, can vary from — œ to + œ (conti- 
nuous indices), and e is equal to + or — (discrete index). By definition, | r, e > is an 
eigenvector common to X, Y, Z, S? and S, : 


X|re>=x|re> 
Y|r,e>=yl|re> 
Zlre>»=zl|r,8> 


s? 1,2) =P |1,0) 


S.|r,e) =e5[1,0) (C-3) 
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Each ket |r, e > is unique to within a constant factor, since X, Y, Z, S? and S, 
constitute a C.S.C.O. The { |r, e > } system is orthonormal (in the extended sense), 


since the sets {|r >} and {| +, | — >) are each orthonormal in $, and 6, 
respectively : 
(rye |r, e> = ôr, dr’ — r) (C-4) 


(6,,, is equal to 1 or O depending on whether z and s are the same or different). 
Finally, it satisfies a closure relation in &: 


y ferline» 0] = [Print cnt | + fær 


r, =><r,-|=1 
(C-5) 


b. {r e») REPRESENTATION 


o. State vectors 


Any state | y > of the space & can be expanded in the {| r, e > } basis. To do 
this, it suffices to use the closure relation (C-5): 


>= Lerner celo) (C-6) 


The vector | Y > can therefore be represented by the set of its coordinates in the 
{ |r, e > } basis, that is, by the numbers : 


(rely) = yt) (C-7) 


which depend on the three continuous indices x, y, z (or, more succinctly, r) and 
on the discrete index e (+ or —). In order to characterize completely the state of an 
electron, it is therefore necessary to specify two functions of the space variables x, y 
and z: 


wir) =< r, + y> 
y-t) =< r, — | y> (C-8) 


These two functions are often written in the form of a two-component spinor, 
which we shall write [y ](r): 


wiw = (49) C9 


The bra < y | associated with the ket | y > is given by the adjoint of (C-6): 


l= E [arco lner nel (C-10) 
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that is, taking (C-7) into account: 


<v | = E [ur vir) <r, e | (C-11) 


The bra < y | is thus represented by the two functions y* (r) and y* (r), which can 
be written in the form of a spinor which is the adjoint of (C-9): 


[y] (r) = (y2 (r) y* e) (C-12) 


With this notation, the scalar product of two state vectors | y > and | ø >, which, 
according to (C-5), is equal to: 


culos =Z [Pruner crol) 
= fort y (r) p, (1) + y (r) g_(r) ] (C-13) 


can be written in the form: 


(wigs | Pr [T'O [oll (C-14) 


This formula is very similar to the one which permitted the calculation of the scalar 
product of two kets of £, from the corresponding wave functions. However, it is 
important to note that here the matrix multiplication of the spinors [w]'(r) and 
[q |(r) must precede the spatial integration. In particular, the normalization of the 
vector | y > is expressed by: 


<vly> = fer [vw I(r) [y 10) = fort Wr)? + [y (117 ] = 1 (C-15) 


Amongst the vectors of $, some are the tensor products of a ket of 6, and 
a ket of 6, (this is the case, for example, for the basis vectors). If the state vector 
under consideration is of this type: 


Iw> =l) 0 |x> (C-16) 


with: 
lp >= [ero lr es, 


lx>=cec,| +> +e_|-deé (C-17) 


Ss 
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the spinor associated with it takes on the simple form: 


[y tr) = (es =) = on“) (C-18) 
g(r) e. bo 
This results from the definition of the scalar product in &, and we have in this case : 
Y, lr) =<r, +| Y> =<r|o><+ |z) = ele, (C-19-a) 
Y lr) =<r, = | y> =<r |p) <- | z> = ple) c- (C-19-b) 


The square of the norm of | y > is then given by: 


<vlw>=<ele><xlx> =(e.)? + je?) fær |e(r)|? (C-20) 
B. Operators 


Let | y” > be the ket obtained from the action of the linear operator A on the 
ket | y > of 6. According to the results of the preceding section, | y'> and | y > 
can be represented by the two-component spinors [y ](r) and [y](r). We are now 
going to show that one can associate with A a 2 x 2 matrix [4] such that: 


[y']e) = [4] [vJtr) (C-21) 


where the matrix elements remain in general differential operators with respect to 
the variable r. 

(i) Spin operators. These were initially defined in &,. Consequently, they act 
only on the £ index of the basis vectors |r, e >, and their matrix form is the one 
stated in $ B. We shall limit ourselves to one example, say that of the operator S,. 
Its action on a vector | y > expanded as in (C-6) gives a vector ly >: 


Y> =h [eww |r, +> (C-22) 


since S, annihilates all the |r, + > kets and transforms |r, — > into A |r, + >. 
The components of | y >in the { |r, ¢ > } basis are, according to (C-22) : 


)=h 
‘(r) = 0 (C-23) 


The spinor representing | y’ > is therefore: 


mw =at 0) (C-24) 


N 


This is indeed what is obtained if one performs the matrix multiplication of the 
spinor [y ](r) by: 


h O 1 
(sa) = 5 (4s + io,) = a(o 0) (C-25) 
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(ii) Orbital operators. Unlike the preceding operators, they always leave 
unchanged the e index of the basis vector | r, e >: their associated 2 x 2 matrices 
are always proportional to the unit matrix. On the other hand, they act on the 
r-dependence of the spinors just as they act on ordinary wave functions. Consider, 
for example, the kets | y" > = X |y > and |y” > = P, | y X. Their components in 
the { |r, e > } basis are, respectively: 


Wir) =< rn el X| y> = x y, (r) (C-26-a) 
Wie) = <n] Pyy =A Zu. (C-26-b) 


The spinors [y"](r) and [y”](r) are thus obtained from [y ](r) by means of the 
2 x 2 matrices: 


[x] = 6 °) (C-27-a) 
0 

[P.] =" un > (C-27-b) 
Oe 


(iii) Mixed operators. The most general operator acting in & is represented, 
in matrix notation, by a 2 x 2 matrix whose elements are differential operators 
with respect to the r variables. For example: 


ho 
lia ° 
LS] == C-28 
i 0p 
or: 
0 0 0 
h ñ? | az ôx oy 
[S . P] =>3(0,P, + oP, + o,P,) =a 0 7 0 (C-29) 
Jx ET DE 
COMMENTS: 


(i) The spinor representation { | r, e > } is analogous to the { | r > } representation 
of &. The matrix element < y | A | > of any operator A of & is given by 
the formula : 


Cul Alo>= i dèr [yT e) [4] Eole) (C-30) 


where [4 | designates the 2 x 2 matrix which represents the operator A (one 
first carries Out the matrix multiplications and then integrates over all space). 
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This representation will only be used when it simplifies the reasoning and 
the calculations: as in 6,, the vectors and operators themselves will be used 
as much as possible. 


Obviously, there also exists a { | p, £ > } representation, whose basis vectors 


are the eigenvectors common to the C.S.C.O. { Pa P,. P., S?, S, |. The 
definition of the scalar product in & vields : 


: l es 
<relpe>=<r]p><ele> = bape Cira (C-31) 


Inthe { | p,e > | representation, one associates with each vector | y > of é a two- 
component spinor : 


= Y (p) 

[v \(p) = Ñ ) (C-32) 
Y -(p) 

with 

WP) =< pP. +|w> 

Y-(p) =< p. -| Y> (C-33) 


According to (C-31), Y, (p) and W_(p) are the Fourier transforms of w, (r) 
and y _ (r): 


y 


P) =<re lv) = Y [drel cre wo 


= sai | d?r ips y (r) (C-34) 


The operators are still represented by 2 x 2 matrices, and those corresponding 
to the spin operators remain the same as in the { |r, ¢ > } representation. 


Probability calculations for a physical measurement 


Using the formalism we have just described, we can apply the postulates of 


chapter III to obtain predictions concerning the various measurements that one can 
imagine carrying out on an electron. We are going to give several examples. 


First of all. consider the probabilistic interpretation of the components y, (r) 


and w_(r) of the state vector |W >. which we assume to be normalized 
[formula (C-15)]. Imagine that we are simultaneously measuring the position of 
the electron and the component of its spin along Oz. Since X, Y, Z and $, constitute 
a C.S.C.O., there exists only one state vector which corresponds to a given 
result : x, y, z and + h/2. The probability ¿32(r, +) of the electron being found 
in the infinitesimal volume d?r around the point r(x, y, z} with its spin “up” (the 
component along Oz equal to + h/2) is equal to: 


Por. +) = |r + |y P dr = y, (0)? dr (C-35) 
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In the same way: 
ËP, -)=| <r, — | y> |? dr = |y t)? der (C-36) 


is the probability of the electron being found in the same volume as before but 
with its spin “down” (component along Oz equal to — h/2). 

If it is the component of the spin along Ox that is being measured at the 
same time as the position, all we need to do is use formulas (A-20) of chapter IV. 
The X, Y, Z and S, operators also form a C.S.C.O.: to the measurement result 
lx, y, 7, + h/2 } corresponds a single state vector: 


Ile RS (C-37) 


The probability of the electron being found in the volume d?r around the point r 
with 1ts spin in the positive direction of the Ox axis is then: 


dr x lr +Y +<5 yl] => Wal) +y per (038) 


Obviously, one can measure the momentum of the electron instead of its 
position. One then uses the components of |y > relative to the vectors |p, ¢ > 
[cf comment (ii) of $1], that is, the Fourier transforms y, (p) of y, (r). The 
probability d°P (p, +) of the momentum being p to within d*p and of the spin 
component along Oz being + h/2 is given by: 


Pp +)=|<p, + | b> ap =p, (pp (C-39) 


The various measurements that we have envisaged until now are all 
“complete” in the sense that they each relate to a C.S.C.O. For “incomplete 
measurements”, several orthogonal states correspond to the same result, and it is 
necessary to sum the squares of the moduli of the corresponding probability 
amplitudes. 

For example, if one does not seek to measure its spin, the probability 4*2 (r) 
of finding the electron in the volume d?r in the neighborhood of the point r is 
equal to: 


PR) = E wm? + fy)? ] dr (C-40) 


This is because two orthogonal state vectors, |r, + > and |r, — >, are associated 
with the result { x, y, z }, their corresponding probability amplitudes being y, (r) 
and Y. (r). 

Finally, let us calculate the probability 2, that the spin component along Oz 
is + hj2 (one is not seeking to measure the orbital variables). There exist an 
infinite number of orthogonal states, for example all the |r, + > with arbitrary r, 
which correspond to the result of the measurement. One must therefore sum over 
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all possible values of r the squares of the moduli of the amplitudes < r, + |Y > = Y, (r), 
which gives : 


P, = | rly top (C-41) 


Of course, if we are considering the component of the spin along Ox instead of 
along Oz, we integrate the result (C-38) over all space. These ideas generalize 
those of $B-2 of chapter IV, where we considered only the spin observables since 
the orbital variables could be treated classically. 


References and suggestions for further reading: 


History of the discovery of spin and references to original articles : Jammer (4.8), 
$34, 

Evidence of spin in atomic physics : Eisberg and Resnick (1.3), chap. 8 ; Born (11.4). 
chap. VI; Kuhn (11.1), chap. III, 89 A.S, A.6 and F ; see references of chapter IV relating 
to the Stern-Gerlach experiment. 

The spin magnetic moment of the electron: Cagnac and Pebay-Peyroula (1.2) 
chap. XII; Crane (11.16). 

The Dirac equation : Schiff (1.18), chap. 13; Messiah (1.17). chap. XX: Bjorken 
and Drell (2.6), chaps. 1 to 4. 

The Lorentz group : Omnes (16.13), chap. 4; Bacry (10.31), chaps. 7 and 8. 

Spin | particles : Messiah (1.17), § XIMM1.21. 
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Severa) complements concerning spin , properties can be found at the end of chapter IV. 


too 


This ts why chapter IX has only two complements. 


Aix: ROTATION OPERATORS Aix is a continuation of complement By, It 
FOR A SPIN 1/2 PARTICLE studies in detail the relationship between the 
spin 3 angular momentum and the geometric 
rotations of this spin. Moderately difficult. Can 
be omitted upon a first reading. 


Bix: EXERCISES Bx: exercise 4 is worked out in detail. It studies 
the polarization of a beam of spin 4 particles 
caused by their reflection from a magnetized 
ferromagnetic material. This method is actually 


used in certain experiments. 
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Complement Ajx 


ROTATION OPERATORS FOR A SPIN 1/2 PARTICLE 


1. Rotation operators in state space 


a. Total angular momentum 
b. Decomposition of rotation operators into tensor products 


2. Rotation of spin states 


a. Explicit calculation of the rotation operators in € 
b. Operator associated with a rotation through an angle of 2n 
c. Relationship between the vectorial nature of S and the behavior of a spin state upon rotation 


3. Rotation of two-component spinors 


We are going to apply the ideas about rotation introduced in complement By, to 
the case of a spin 1/2 particle. First, we shall study the form that rotation operators 
take on in this case. We shall then examine the behavior, under rotation, of the ket 
representing the particle’s state and of the two-component spinor associated with it. 


1. Rotation operators in state space 
a. TOTAL ANGULAR MOMENTUM 


A spin 1/2 particle possesses an orbital angular momentum L and a spin 
angular momentum S. It is natural to define its total angular momentum as the 
sum of these two angular momenta: 


J=L+S (1) 


This definition is clearly consistent with the general considerations discussed 
in complement B,,. It insures that not only R and P, but also S, be vectorial 
observables. (To test this, it is sufficient to calculate the commutators between 
the components of these observables and those of J; cf. $5-c of complement By). 


b. DECOMPOSITION OF ROTATION OPERATORS INTO TENSOR PRODUCTS 


In the state space of the particle under study, the rotation operator R,(a) 
is associated with the geometrical rotation @,(«) through an angle « about the unit 
vector u (cf. complement By, $4): 


Ro) = e 0% (2) 
where J is the total angular momentum (1). 
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Since L acts only in $, and S only in &, (which implies, in particular, that all 


components of L commute with all components of S), we can write R,(«) in the 
form of a tensor product : 


Rola) = "R,(a) O OR) (3) 
where: 

OR (a) = e77" (4) 
and: 

OR (a) = eas (5) 


are the rotation operators associated with %,(a) in é, and é, respectively. 
Consequently, if one performs the rotation %, (a) on a spin 1/2 particle whose 
state is represented by a ket which is a tensor product: 


Iy) =|e> @|x (6) 
with: 

lo) es, 

|x) E£, (7) 
its state after rotation will be: 

Iy = Ra) | y>) = [PRE] 0>] 9 [PRE] (8) 


The spin state of the particle is therefore also affected by the rotation. This is what 
we are going to study in more detail in § 2. 


2. Rotation of spin states 


We have already studied ($3 of complement B,,) the rotation operators "R 
in the space &,. Here we are interested in the operators ''R which act in the spin 
state space 6. 


a. EXPLICIT CALCULATION OF THE ROTATION OPERATORS IN Ĝĝ, 
As in chapter [X, set: 

h 

“6 9 

; (9) 

We want to calculate the operator : 


(OR (a) 2 os e am (10) 


To do this, let us use the definition of the exponential of an operator: 
OR (x)= 1 EE ee da u)? + T ouy + 
2 2! 2 l oon! 2 j 
(11) 
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Now, applying identity (B-12) of chapter IX, we immediately see that: 


(o.u =u = 1 (12) 
which leads to: 
(o. uy = 1 if n is even (13) 
o.u_ ifnis odd 


Consequently, if we group together the even and odd terms respectively, expan- 
sion (11) can be written : 


(OR (a) = [ = a(5) +.. + ar (3) ° + sl 


that is, finally : 


Rala) = cos = —io.u sin5 | (15) 


17 aa a 


It will be very easy to calculate the action of the operator “)R, in this form, on any 
spin state. 
Using this formula, we can write the rotation matrix R('/2(a) explicitly 


in the {| + >, | — > } basis, since we already know [formulas (B-9) of chapter IX] 
the matrices which represent the o,, c, and c, operators. We find: 
cosZ — iu, sin= (— iu, — u,) sin= 
(1/2) 2 úl 2 E á 2 
Roa) = a : (16) 
(— iu, + u,) sin 5 cos 5 + iu, sin 5 


where u,, u, and u, are the cartesian components of the vector u. 


b. OPERATOR ASSOCIATED WITH A ROTATION 
THROUGH AN ANGLE OF 27 


If we take 2z for the angle of rotation a, the geometrical rotation 2,(27) 
coincides, whatever the vector u may be, with the identity rotation. However, 
if we set a = 27 in formula (15), we see that: 


(OR (Qn) = — 1 (17) 
whereas : 
R,(0) = 1 (18) 


The operator associated with a rotation through an angle of 27 is not the identity 
operator, but minus this operator. The group law is therefore conserved only 
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locally in the correspondence between geometrical rotations and rotation operators 
in &, [see discussion in complement B,,, comment (iii) of $3-c-y |. This is due to 
the half-integral value of the spin angular momentum of the particle which we are 
considering. 

The fact that the spin state changes sign during a rotation through an angle 
of 27 is not disturbing, since two state vectors differing only by a global phase factor 
have the same physical properties. It is more important to study the way in which 
an observable A transforms during such a rotation. It is easy to show that: 

A’ = OR, (27) A R27) = A (19) 


This result is quite satisfying since a rotation through 2x cannot modify the 
measuring device associated with A. Consequently, the spectrum of A’ must remain 
the same as that of A. 


COMMENT: 
We showed in complement By, [comment (iii) of $3-c-y] that : 
OR, (27) = 1 (20) 
Consequently, in the global state space $ = &, ® &,, as in &,, we have: 
R,(27) = OR, 27) Q CR (27) = — 1 (21) 
c. RELATIONSHIP BETWEEN THE VECTORIAL NATURE 


OF S AND THE BEHAVIOR OF A SPIN STATE UPON ROTATION 


Consider an arbitrary spin state | y >. We showed in chapter IV ($B-1-c) that there must 
exist angles 0 and @ such that | y > can be written (except for a global phase factor which has no 
physical meaning): 


0 y 0 
|x) =e e? cos >| +> +e? sin | => (22) 


|x > then appears as the eigenvector associated with the eigenvalue + h/2 of the compo- 
nent S . y of the spin S along the unit vector y defined by the polar angles 6 and py. Now let us 
perform an arbitrary rotation on the state | y >. Let us call v’ the result of the transformation 
of y by the rotation being considered. Since S is a vectorial observable, the state | y' > after the 
rotation must be an eigenvector, with the eigenvalue + h/2, of the component S . v’ of S along 
the unit vector v’ (cf. complement By, $5): 


o a SR ee (23) 
with: 

y’ = R y (24) 
We shall be satisfied with verifying this for a specific case (cf. fig. 1). Choose for v the unit vector e. 


of the Oz axis, and for v’ an arbitrary unit vector, with polar angles @ and e. v' is obtained from 
v = e, bya rotation through an angle 0 about the unit vector u, which is fixed by the polar angles : 


T 
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Thus we must show that : 


ERAH = |H (26) 


y = 

FIGURE | 
/ A rotation through an angle Y about u brings 

the vector y = e, onto the unit vector v’, with 

4 polar angles 0 and y. 

The cartesian components of the vector u are: 

u, = — sing 

u, = COS p (27) 

u,=0 


so the operator '"R,(0) can be written, using formula (15): 


0 0 
Ra (0)= cos 5 —io.u sin; 


= cos— — i (— 0, sing +o, cos q) sins 
6 1 . 0 
= cos — 5 (0, e-o. e”) sin; (28) 
with: 
04 =0,+ io, (29) 


Now we know [cf. formulas (B-7) of chapter IX] that: 


a,|+> =0 
ao_|+> =2|-) (30) 


The result of the transformation of the ket | + > by the operator 'R,(0) is therefore 
0 ._ . O 
SRL) +> = cos 5 | +> +e” sinz | -> (31) 
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We recognize, to within a phase factor, the ket | + > ,.[cf. formula (22)]: 
PRO) | +> = e'?? | 4+), (32) 


3. Rotation of two-component spinors 


We are now prepared to study the global behavior of a spin 1/2 particle under 
rotation. That is, we shall now take into account both its external and internal 
degrees of freedom. 

Consider a spin 1/2 particle whose state is represented by the ket | y > of the 
state space $ = £, Q 6, . The ket | y > can be represented by the spinor [w](r), 
having the components: 


wr) =< r e| y> (33) 


If we perform an arbitrary geometrical rotation 2 on this particle, its state then 
becomes: 


Iy =R|y> (34) 
where: 
R= "RO OR (35) 


is the operator associated, in &, with the geometrical rotation 4. How is the spinor, 
[w’](r), which corresponds to the state | y” >, obtained from [y] (r)? 
In order to answer this question, let us write the components y; (r) of [w’]: 


wir) =< r e| y> =< r, e| R|y> (36) 


We can find the components of [y ](r) by inserting the closure relation relative to 
the { |r’, e” > } basis between R and | y >: 


Wir) = 2 fer r E| R| e y< <r e ]y> (37) 


Now, since the vectors of the { | r, > } basis are tensor products, the matrix elements 
of the operator R in this basis can be decomposed in the following manner: 


(re|R|re)=<r]OR|ro< a| ORE (38) 
We already know [cf. complement By;, formula (26) ] that: 

<r | PR|r> =< B'rlr’> = ofr’ — (2~'r)] (39) 
Consequently, if we set: 

< e| R |e) = RY (40) 
formula (37) can finally be written: 

wt) =F RLY (AO) A (41) 

| E 
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that is, explicitly : 


y”, (r) RU» RG!) Y (B2-'r) 
= (42) 
w'_(r) RO) RU?) w_(#-'r) 


Thus we obtain the following result : each component of the new spinor [y*] 
at the point r is a linear combination of the two components of the original spinor [y | 
evaluated at the point 2” *r (that is, at the point that the rotation maps into r)*. 
The coefficients of these linear combinations are the elements of the 2 x 2 matrix 
which represents '"R in the { | + >, | — > } basis of €, [cf. formula (16)]. 


References and suggestions for further reading: 


Feynman III (1.2), chap. 6; chap. 18, $18-4 and added note 1; Messiah (1.17), 
App. C; Edmonds (2.21), chap. 4. 

Rotation groups and SU(2): Bacry (10.31), chap. 6; Wigner (2.23), chap. 15; 
Meijer and Bauer (2.18), chap. 5. 

Experiments dealing with rotations of a spin 1/2: article by Werner et al. (11.18). 


* Note the close analogy between this behavior and that of a vector field under rotation. 
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Complement B,x 


EXERCISES 


1. Consider a spin 1/2 particle. Call its spin S, its orbital angular momentum L 
and its state vector | y >. The two functions y, (r) and y _ (r) are defined by: 


Wir) =< r, + |W) 


Assume that: 
1 
) = R(r)| Y90, p) + — YO, 
yat) | Ya RE J 
y= wall 1(0, 9) — Y90. 0)] 


where r, 0, y, are the coordinates of the particle and R(r) is a given function of r. 
a. What condition must R(r) satisfy for | yw > to be normalized ? 


b. S, is measured with the particle in the state | y >. What results can be 
found, and with what probabilities? Same question for L., then for S,. 


c. A measurement of L?, with the particle in the state | y >, yielded zero. 
What state describes the particle just after this measurement? Same question if the 
measurement of L? had given 2h?. 


2. Consider a spin 1/2 particle. P and S designate the observables associated 
with its momentum and its spin. We choose as the basis of the state space the 
orthonormal basis | p,, Py» Pz, + > of eigenvectors common to P,, P,, P, and S, 
(whose eigenvalues are, respectively, p,, p,, p, and + h/2). 

We intend to solve the eigenvalue equation of the operator A which is 
defined by : 


A=S.P 


a. Is A Hermitian? 


b. Show that there exists a basis of eigenvectors of A which are also eigen- 
vectors of P,, P,, P,. In the subspace spanned by the kets | Px» Py» Pz» + >, where 
Px, Py» P; are fixed, what is the matrix representing A ? 

c. What are the eigenvalues of A, and what is their degree of degeneracy ? 
Find a system of eigenvectors common to A and P,, Pako 
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3. The Pauli Hamiltonian 


a . _ A 
The Hamiltonian of an electron of mass m, charge g, spin 5 6 (0,, Oy, 0,: 
Pauli matrices), placed in an electromagnetic field described by the vector poten- 


tial A(r, 1) and the scalar potential U(r, 1), is written: 


A — 2 qħ 
H = 7 [P — gA(R, t)]? + qU(R, t) — Sms B(R, t) 


The last term represents the interaction between the spin magnetic moment E 6 
m 
and the magnetic field B(R, t) = V x A(R, t). 
Show, using the properties of the Pauli matrices, that this Hamiltonian can 
also be written in the following form (“the Pauli Hamiltonian”): 


H => (0. [P — q4A(R, 1) }* +qUIR, 1) 


4. We intend to study the reflection of a monoenergetic neutron beam which is 
perpendicularly incident on a block of a ferromagnetic material. We call Ox the 
direction of propagation of the incident beam and yOz the surface of the ferro- 
magnetic material, which fills the entire x > 0 region (see figure). Let each incident 
neutron have an energy E and a mass m. The spin of the neutrons is s = 1/2 and 
their magnetic moment is written M = yS (y is the gyromagnetic ratio and S is the 
spin operator). 


incident neutrons 
—— 


The potential energy of the neutrons is the sum of two terms: 

— the first one corresponds to the interaction with the nucleons of the sub- 
stance. Phenomenologically, it is represented by a potential V(x), defined by V(x) = 0 
for x < 0, V(x) = h > Ofor x > 0. 

— the second term corresponds to the interaction of the magnetic moment 
of each neutron with the internal magnetic field B, of the material (B, is assumed 
to be uniform and parallel to Oz). Thus we have W = 0 for x < 0, W = S- 
for x > 0 (with wọ = — yBp). Throughout this exercise we shall confine ourselves 
to the case: 

hwo 


0< > <h 
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a. Determine the stationary states of the particle which correspond to a 
positive incident momentum and a spin which is either parallel or antiparallel 
to Oz. 


b. We assume in this question that W — ha,/2 < E < Vy + hw,/2. The 
incident neutron beam is unpolarized. Calculate the degree of polarization of the 
reflected beam. Can you imagine an application of this effect ? 


c. Now consider the general case where E has an arbitrary positive value. 
The spin of the incident neutrons points in the Ox direction. What is the direction 
of the spin of the reflected particles (there are three cases, depending on the relative 
values of E and V, + ha,/2)? 


Solution of exercise 4 
a. The Hamiltonian H of the particle is: 


p? 
H = 5> +V(xX)+W (1) 


V(X), which acts only on the orbital variables, commutes with S_. Since W is 
proportional to S,, it also commutes with this operator. Furthermore, V(X) com- 
mutes with P, and P., as well as with W (obviously, since W acts only on the spin 
variables). We can therefore find a basis of eigenvectors common to H, S., Paba 
which can be written: 


Em...» =|) 90 1P,>90|p.>0|+> (2) 
with: 


pies, 
| Py >€ 8,3 P,| Py > = Py| Py> 
|p. >€&,3P.|p,> = p¿|p. > 


h 
}+>e&5S,)4>= 45/ +> (3) 


where the ket | yj > is a solution of the eigenvalue equation: 


P: los a, ho, ty + 
ESO Es (py + pz) t=" | |e > = El ve > (4) 


We assume in the statement of the problem that the neutron beam is normally 
incident, so we can set p, = p, = 0. Let pz (x) = < x | pf > be the wave function 
associated with | gf >; it satisfies the equation: 


G A ee ee E(x) = E pe (x) (5) 


Thus the problem is reduced to that of a classical one-dimensional “square well” : 
reflection from a “potential step” (cf. complement H,). 
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In the x < 0 region, V(x) is zero and the total energy E (which is positive) 
is greater than the potential energy. We know in this case that the wave function 
is a Superposition of imaginary oscillatory exponentials: 


olx) = A, e" + Be if x <0 (6) 


with: 
p= 
2m 


a VR (7) 


A, gives the amplitude of the wave associated with an incident particle having 
a spin either parallel or antiparallel to Oz. B, gives the amplitude of the wave 
associated with a reflected particle for the same two spin directions. 

In the x > 0 region, V(x) is equal to K and, depending on the relative values 
of E and K + hw,/2, the wave functions can behave like oscillatory or damped 
exponentials. We shall consider three cases: 


(1) If E> ve + 220 we set: 


and the transmitted wave behaves like an oscillatory exponential : 
ot (x) = C, e™+* if x>0 (9) 


Moreover, the continuity conditions for the wave function and its derivative imply 
[cf. complement H,, relations (13) and (14)]: 


B,_k-K; Cy _ ek (10) 
A, k +k, A, ktk, 
(ii) If, onthe other hand, E < V, — Mo. we must introduce the quantities p, : 
dd 
¡2m hw 
= j/—(vV+—2- (11 
P+ TE ( 0 > ) ) 
and the wave in the x > 0 region is a real, damped exponential (evanescent wave): 
pa lx) = D; ee” if x >0 (12) 


with, in this case [cf. complement H,, equations (22) and (23)|: 


Bee, Pe (13) 
A, k+ip,’ A, k+ip, 


hw hw 
(iii) Finally, in the intermediate case Vy — Ea <E<V,+ A we have: 


plx) = D, e *e if x>0 (14-a) 
pg (x) = C_e*-* if x>0 (14-b) 
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[definitions (8) and (11) of k” and p, are still valid]. Depending on the spin orien- 
tation, the wave is either a damped or an oscillatory exponential. We then have: 


B k — ip, D 2k 

E 7 E 15- 
A, kt+ip,’ A, k+ip, ee) 
B k-k C_ 2k 

Se a a E 15-b 
A k +k? A_ Atk ( ) 


hwo 

7° 
above. If the projection onto Oz of the incident neutron spin is equal to h/2, 
the corresponding reflection coefficient is: 


b. When V, = <E< VW% + we are in the situation of case (iii) 


= 1 (16) 


On the other hand, if the projection of the spin onto Oz is equal to — h/2, the 
reflection coefficient is no longer 1, since it is given by: 


2 2 
k — k 
= (se) dl a 


Thus we see how the reflected beam can be polarized since, depending on the 
direction of its spin, the ngutron has a different probability of being reflected. An 
unpolarized incident beam can be considered to be formed of neutrons whose spins 
have a probability 1/2 of being in the state | + > and a probability 1/2 of being 
in the state | — >. Taking (16) and (17) into account, we see that the probability 


R- =| 


AL 


oe PA 
TFR” 


TER Therefore, the degree of polarization of the 


that a particle of the reflected beam will have its spin in the state | + > is 


while for the state | — > it is 


reflected beam is: 


T= oe en 2kk"_ (18) 
1 +R kK +k? 
In practice, reflection from a saturated ferromagnetic substance is actually 
used in the laboratory to obtain beams of polarized neutrons. To increase the degree 
of polarization obtained, the beam is made to fall obliquely on the surface of the 
ferromagnetic mirror; thus, the theoretical results obtained here are not directly 
applicable. However, the principle of the experiment is the same. The ferromagnetic 
substance chosen is often cobalt. When cobalt is magnetized to saturation, one can 
obtain high degrees of polarization T (T > 80 %). Note, furthermore, that the same 
neutron beam reflection device can serve as an “analyzer” as well as a “polarizer” 
for spin directions. This possibility has been exploited in precision measurements 
of the magnetic moment of the neutron. 
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c. Consider a neutron whose momentum, of magnitude p = hk, is parallel 
to Ox. Assume that the projection < S, > of its spin is equal to h/2. Its state is 
[cf. chap. IV, relation (A-20)]: 


E OL] +> + [=>] (19) 
with: 
Se Oa (20) 


How can we construct a stationary state of the particle in which the incident wave 
has the form (19)? We simply have to consider the state: 


l 7 
| vs > = Bll Eoo > + 90.0 >] (21) 
which is a linear combination of two eigenkets of H defined in (2), associated with 


the same eigenvalue E = p?/2m. The part of the ket | Ys > which describes the 
reflected wave is then: 


[<p> s-r la+ +81 (22) 


Z 


where B, and B_ are given, depending on the case, by (10), (13) or (15) (A, and A_ 
being replaced by |). Let us calculate, for a state such as (22), the mean value < S >. 
Since this state is a tensor product, the spin variables and the orbital variables are 
not correlated. Therefore, < S > can easily be obtained from the spin state vec- 
tor B, | + >+ B. | — >, which gives: 


h B*B_ + B*B 
SS ee e 23-a 
tee 2 |B,|? + |B_|? eee) 


i(B*B, — B*B_) 


23-b 
B.P + BP a 


h 
ae 


(S,) = h 1B,) —]B-P (23-c) 


218," + |B_|? 


Three cases can then be distinguished: 

(i) IFE > KW + hew,/2, we see from (10) that B, and B_ are real. Formulas (23) 
then show that < S, > and < S, > are not zero but that < S, > = 0. Upon reflection 
of the neutron, the spin has thus undergone a rotation about Oy. Physically, it is 
the difference between the degrees of reflection of neutrons whose spin is parallel 
to Oz and those whose spin is antiparallel to Oz which explains why the < S. > com- 
ponent becomes positive. 
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(ii) If E < K — hw /2, equations (13) show that B, and B_ are not real: 
they are two complex numbers having different phases but the same modulus. 
According to (23), we have, in this case, < S, X = 0 but < S, > 40and<S, > # 0. 
Upon reflection of the neutron, the spin thus undergoes a rotation about Oz. The 
physical origin of this rotation is the following : because of the existence of the 
evanescent wave, the neutron spends a certain time in the x > 0 region; the 
Larmor precession about B, that it undergoes during this time accounts for the 
rotation of its spin. 

(iii) If Y, — howp/2 < E < Y, + hw /2, B, is a complex number while B_ is 
a real number, and their moduli are different. None of the spin components, < S, >, 
< S, > or < S, >, is then zero. This rotation of the spin upon reflection of the neutron 
is explained by a combination of the effects pointed out in (/) and (i/). 
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A. INTRODUCTION 


1. Total angular momentum in classical mechanics 


Consider a system of N classical particles. The total angular momentum L 
of this system with respect to a fixed point O is the vector sum of the individual 
angular momenta of the N particles with respect to this point O: 


N 
v-Y¢ (A-1) 
i=1 
with: 
2, =r; x p; (A-2) 


The time derivative of £ is equal to the moment with respect to O of the external 
forces. Consequently, when the external forces are zero (an isolated system) or all 
directed towards the same center, the total angular momentum of the system (with 
respect to any point in the first case and with respect to the center of force in the 
second one) is a constant of the motion. This is not the case for each of the individual 
angular momenta £, if there are internal forces, that is. if the various particles 
of the system interact. 

We shall illustrate this point with an example. Consider a system composed 
of two particles, (1) and (2), subject to the same central force field (which can be 
created by a third particle assumed to be heavy enough to remain motionless at 
the origin). If these two particles exert no force on each other, their angular 
momenta #, and .£, with respect to the center of force O are both constants of 
the motion. The only force then acting on particle (1), for example, is directed 


: se d 
towards O; its moment with respect to this point is therefore zero, as is T Fi 


On the other hand, if particle (1) is also subject to a force due to the presence of 
particle (2), the moment with respect to O of this force is not generally zero, and, 
consequently, £, is no longer a constant of the motion. However, if the interaction 
between the two particles obeys the principle of action and reaction, the moment 
of the force exerted by (1) on (2) with respect to O exactly compensates that of the 
force exerted by (2) on (1): the total angular momentum is conserved over time. 

Therefore, in a system of interacting particles, only the total angular momentum 
is a constant of the motion: forces inside the system induce a transfer of angular 
momentum from one particle to the other. Thus we see why it is useful to study 
the properties of the total angular momentum. 
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2. The importance of total angular momentum 
in quantum mechanics 


Let us treat the preceding example quantum mechanically. In the case of two 
non-interacting particles, the Hamiltonian of the system is given simply, in the 
[| r,, r, > ) representation: 


H, = H, + H, (A-3) 
with: 
h2 
H, = "Ont + V(r,) 
h? 
H, = ae + V(r.) (A-4) 


[ 1, and u, are the masses of the two particles, V(r) is the central potential to which 
they are subject, and 4, and 4, denote the Laplacian operators relative to the 
coordinates of particles (1) and (2) respectively]. We know from chapter VII 
($A-2-a) that the three components of the operator L, associated with the angular 
momentum £, of particle (1) commute with H,: 


[L,, H] =0 (A-5) 


Also, all observables relating to one of the particles commute with all those 
corresponding to the other one; in particular : 


[L,, H,] =0 (A-6) 


From (A-5) and (A-6), we see that the three components of L, are constants of the 
motion. An analogous argument is obviously valid for L,. 

Now assume that the two particles interact, and that the corresponding 
potential energy v(|r, — r,|) depends only on the distance between them |r, — r,|* : 


lr = r,| = V(x, — x2)? +i — ya)” + (21 — 22) (A-7) 
In this case, the Hamiltonian of the system is: 


where H, and H, are given by (A-4). According to (A-5) and (A-6), the commutator 
of L, with H reduces to: 


[L,, A] = [L,, v(|r = r>|)] (A-9) 


that is, for the component L,,, for example: 


iz» 
h Ov Ov 
[Li H] = [L,,. o(|r, = r|)] = He ay, — Y =) (A-10) 


* The corresponding classical forces then necessarily obey the principle of action and reaction. 
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Expression (A-10) is generally not zero: L, is no longer a constant of the motion. 
On the other hand, if we define the total angular momentum operator L by an 
expression similar to (A-1): 


L= L, +L, (A-11) 


we obtain an operator whose three components are constants of the motion. For 
example, we find : 


[£.. H] = [L + Laz, H] (A-12) 
According to (A-10), this commutator is equal to: 


[L., H] T [Li. F Laz H] 


h Ov ov Ov Ov 
aa E (A-13) 


But, since v depends only on |r, — r,|, given by (A-7), we have: 


ov -pêl =n iy Xe (A-14-2) 


EA ax, lr, —r,| 

Ov _ 0 ]r, -e _ A 7 A 

x O A T lr, — r] (oer) 
ôv ôv Ov ôv 


and analogous expressions for —-,—-,— an 
E á Oy, Oy,’ 02, 02, 


considered as a function of a single variable). Substituting these values into (A-13): 


(v is the derivative of v, 


[L,, H] = 7 lr, a F Lo, — ya) — y(x, — x3) 
+ xy, — yr) — Yale x} 
=0 (A-15) 


We therefore arrive at the same conclusion as in classical mechanics. 

Until now we have implicitly assumed that the particles being studied had 
no spin. Now let us examine another important example: that of a single particle 
with spin. First, we assume that this particle is subject only to a central potential V(r). 
Its Hamiltonian is then the one studied in $ A of chapter VII. We know that the 
three components of the orbital angular momentum L commute with this Hamil- 
tonian. In addition, since the spin operators commute with the orbital observables, 
the three components of the spin S are also constants of the motion. But we shall 
see in chapter XII that relativistic corrections introduce into the Hamiltonian 
a spin-orbit coupling term of the form: 


Bso = E(r)L.S (A-16) 


where ¿(») is a known function of the single variable r (the physical meaning of this 
coupling will be explained in chapter XII). When this term is taken into account, 
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L and S no longer commute with the total Hamiltonian. For example*: 


[L Hso] = ¿(r) [L.. LS, F L,S, + L.S,] 
= Er) (RLS, — ihL,S,) (A-17) 


and, similarly : 


S,, Hgo] = &(r) ES.. L.S, + L,S, + L.S] 
z 0 y y 
= ¿(r) (ALS, — ihL,S,) (A-18) 


However, if we set: 
J=L+S (A-19) 


the three components of J are constants of the motion. To see this, we can simply add 
equations (A-17) and (A-18): 


[7> Hso] = [L + S., Hgo] =0 (A-20) 


(an analogous proof could be given for the other components of J). The operator J 
defined by (A-19) is said to be the total angular momentum of a particle with spin. 

In the two cases just described, we have two partial angular momenta J, 
and J,, which commute. We know a basis of the state space composed of eigen- 
vectors common to J?, /,,, J2, J,. However, J, and J, are not constants of the 
motion, while the components of the total angular momentum: 


J=J, +J, (A-21) 


commute with the Hamiltonian of the system. We shall therefore try to construct, 
using the preceding basis, a new basis formed by eigenvectors of J? and J,. The 
problem thus posed in general terms is that of the addition (or composition) of two 
angular momenta J, and J,. 

The importance of this new basis, formed of eigenvectors of J? and J,, is easy 
to understand. To determine the stationary states of the system, that is, the 
eigenstates of H, it is simpler to diagonalize the matrix which represents H in this 
new basis. Since H commutes with J? and J., this matrix can be broken down into 
as many blocks as there are eigensubspaces associated with the various sets of 
eigenvalues of J? and J, (cf. chap. II, $ D-3-a). Its structure is much simpler than 
that of the matrix which represents H in the basis of eigenvectors common to J%, 
Ji, 53, J,,, since neither J,, nor J,, generally commutes with H. 

We shall leave aside for now the problem of the diagonalization of H (whether 
exact or approximate) in the basis of eigenstates of J? and J,. Rather, we shall 
concentrate on the construction of this new basis from the one formed by the 
eigenstates of J?, Jin, J}, J,,. A certain number of physical applications (many- 
electron atoms, fine and hyperfine line structure, etc.) will be considered after we 
have studied perturbation theory (complements of chapter XI and chapter XII). 


* To establish (A-17) and (A-18), one uses the fact that L, which acts only on the angular 
variables 6 and @, commutes with ¿(r), which depends only on r. 
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We shall begin ($B) with an elementary treatment of a simple case, in which 
the two partial angular momenta we wish to add are spin 1/2’s. This will allow us 


to familiarize ourselves with various aspects of the problem, before we treat, in $C, 
the addition of two arbitrary angular momenta. 


B. ADDITION OF TWO SPIN 1/2'S. ELEMENTARY METHOD 


1. Statement of the problem 


We shall consider a system of two spin 1/2 particles (electrons or silver atoms 
in the ground state, for example), and we shall be concerned only with their spin 
degrees of freedom. Let S, and S, be the spin operators of the two particles. 


a. STATE SPACE 


We have already defined the state space of such a system. Recall that it is 
a four-dimensional space, obtained by taking the tensor product of the individual 
spin spaces of the two particles. We know an orthonormal basis of this space, which 
we shall denote by {| ¢,, €, > }, that is, explicitly : 


flene t= |+ +). 


+, —>.|—. +4), 


=p (B-1) 


These vectors are eigenstates of the four observables Si, S,,, Sí, S,. (which are 
actually the extensions, into the tensor product space, of operators, defined in each 
of the spin spaces): 


2 3 3 
Si | éng > = Si ene = Gh? |e. 82) (B-2-a) 
h 
Si.) 8, >= £15 |En &2 > (B-2-b) 
h 
Sy, | 8182 > = a | éis £2 > (B-2-c) 


Sí, S$, S,, and S,, constitute a C.S.C.O. (the first two observables are actually 
multiples of the identity operator, and the set of operators remains complete even 
if they are omitted). 


b. TOTAL SPIN S. COMMUTATION RELATIONS 


We define the total spin S of the system by: 
S=S, +8, (B-3) 


It is simple, knowing that S, and S, are angular momenta, to show that S is as well. 
We can calculate, for example, the commutator of S, and S,: 


[S.. S,] a [Six + Szo Siy + Sa] 
= [Si S] + [S,,. Say] 
inS,. + ihs,, 
= NS. (B-4) 
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The operator S? can be obtained by taking the (scalar) square of equa- 
tion (B-3): 


S? = (S, 48, = S? + S? + 2S, . S, (B-5) 


since S, and S, commute. The scalar product S, . S, can be expressed in terms of 
the operators S,,, S,, and S,,, S,,; it is easy to show that: 


S,. S, E 541.595 F S1,82, + S12522 


1 
= 5 (81,82 - + 8,-84) + 81.82, (B-6) 


Note that, since S, and S, each commute with S? and S2, so do the three 
components of S. In particular, S? and S, commute with S? and S2: 


[S., S1] = [S,,S¿] = 0 (B-7-a) 
[S?, S?] = [S?, S2] = 0 (B-7-b) 


In addition, S, obviously commutes with S,, and S,,: 
LS.. Siz] = LS, S3] =0 (B-8) 
However, S? commutes with neither S,, nor S,, since, according to (B-5): 


[S?, Siz] z [St + S; F 25, $ S», Si] 


= 2[S, . S,, Si] 
= 2[S, 83, + S1 Soy. S12] 
= 2ih(— S,,Sz, + S,,87,) (B-9) 


[this calculation is analogous to the one performed in (A-17) and (A-18)]. The 
commutator of S? with S,, is, of course, equal and opposite to the preceding one, 
so that S, = S,, + S,, commutes with S°. 


c. THE BASIS CHANGE TO BE PERFORMED 


The basis (B-1), as we have seen, is composed of eigenvectors common to the 
C.S.C.O.: 


{ St, S3, Siz S2z } (B-10) 
Also, we have just shown that the four observables : 
Sí, Si, S?, S, (B-11) 


commute. We shall see in what follows that they also form a C.S.C.O. 

Adding the two spins S, and S, amounts to constructing the orthonormal 
system of eigenvectors common to the set (B-11). This system will be different 
from (B-1), since S* does not commute with S,, and S,,. We shall write the vectors 
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of this new basis | S, M >, with the eigenvalues of S? and S? (which remain the 
same) implicit. The vectors | S, M > therefore satisfy the equations: 


S?|S,M > = $3|S,M) =R? |S, M) (B-12-a) 
S2] S,M > = S(S + 1}?] S, M > (B-12-b) 
S,|S,M > = Mh|S,M) (B-12-c) 


We know that S is an angular momentum. Consequently, S must be a positive 
integer or half-integer, and M varies by one-unit jumps between — S and + S. 
The problem is therefore to find what values S and M can actually have, and to 
express the basis vectors | S, M > in terms of those of the known basis. 

In this section, we shall confine ourselves to solving this problem by the elemen- 
tary method involving the calculation and diagonalization of the 4 x 4 matrices 
representing S? and S, in the { | ¢,, e, > } basis. In $C, we shall use another, more 
elegant, method, and generalize it to the case of two arbitrary angular momenta. 


2. The eigenvalues of S, and their degrees of degeneracy 


The observables S? and Sí are easy to deal with: all vectors of the state space 
are their eigenvectors, with the same eigenvalue 3/°/4. Consequently, equa- 
tions (B-12-a) are automatically satisfied for all kets | S, M >. 

We have already noted [formulas (B-7) and (B-8)] that S, commutes with 
the four observables of the C.S.C.O. (B-10). We should therefore expect the basis 
vectors {| &, €, > } to be automatically eigenvectors of S.. We can indeed show, 
using (B-2-b) and (B-2-c), that: 


1 
Sz] 81122) = (Sia + Sa) | En €, > = 5 (6 + £,)h |e, & > (B-13) 
| e,, €, > is therefore an eigenstate of S, with the eigenvalue: 
1 
M =5 (6, +) (B-14) 


Since ¢, and e, can each be equal to + 1, we see that M can take on the values + 1, 
0 and — |. 

The values M =1 and M = — 1 are not degenerate. Only one eigenvector 
corresponds to each of them : | +, + > for the first one and | —, — > for the second 
one. On the other hand, M = 0 is two-fold degenerate: two orthogonal eigenvectors 
are associated with it, | +, — > and | —, + >. Any linear combination of these two 
vectors is an eigenstate of S,, with the eigenvalue 0. 
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These results appear clearly in the matrix which represents S. in the 


{ | e,, €, > } basis. Choosing the basis vectors in the order indicated in (B-1), that 
matrix can be written : 


1 0 0 O 
0 0 0 0 
S.) =h B-15 
(S.) a oo (B-15) 
0 0 O ~1 
3. Diagonalization of $? 


All that remains to be done is to find and then diagonalize the matrix which 
represents S? in the { | ¢,, e, > } basis. We know in advance that it is not diagonal, 


since S? does not commute with S,, and S,.. 


a. CALCULATION OF THE MATRIX REPRESENTING S? 


We are going to apply S? to each of the basis vectors. To do this, we shall use 
formulas (B-5) and (B-6): 


S? = Si + S} + 28,87, + SiS + 8,8), (B-16) 
The four vectors | e,, e, > are eigenvectors of S?, S3, S,, and S,, [formulas (B-2)], 


and the action of the operators S,, and S,, can be derived from formulas (B-7) of 
chapter IX. We therefore find : 


S| + +> = (37 Is 4 +) 
= 2h? | +, +) (B-17-a) 
s 3a oes 1,3 3 
SC| +, ->= qh? ta” | +.-> —5h ay Se | Se +> 
=W[} +,-—>+]-.4+)] (B-17-b) 
2 3 2 3 2 l2 2 
Si- += re +qA a |e A > 
=H[] —,+>+]|+,=)] (B-17-c) 
2.3 = 3 2 3 21 _ 1, 
S?| -, > = (Gn ae | -, EESE | ey 
= 2h? | a E (B-17-d) 
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The matrix representing S? in the basis of the four vectors | ¢,, e, >, arranged 
in the order given in (B-1), is therefore: 


2/0 0 0 
—- =F E 
, 01 1,0 
(S%) =4* | | (B-18) 
O'I 110 
0 0 0/2 
COMMENT: 


The zeros appearing in this matrix were to be expected. S? commutes 
with S,, and therefore has non-zero matrix elements only between eigenvectors 
of S, associated with the same eigenvalue. According to the results of §2, the 
only non-diagonal elements of S? which could be different from zero are those 
which relate | +, — >to | —, + >. 


b. EIGENVALUES AND EIGENVECTORS OF S? 


Matrix (B-18) can be broken down into three submatrices (as shown by the 
dotted lines). Two of them are one-dimensional: the vectors | +, + > and | =,-> 
are eigenvectors of S?, as is also shown by relations (B-17-a) and (B-17-d). The 
associated eigenvalues are both equal to 2h’. 

We must now diagonalize the 2 x 2 submatrix: 


Bee afl 1 
(S v =P (| ) (B-19) 


which represents S? inside the two-dimensional subspace spanned by | +, — > and 
| =, + >, that is, the eigensubspace of S, corresponding to M = 0. The eigen- 
values Ah* of matrix (B-19) can be obtained by solving the characteristic equation: 


(L= 4? -1= (B-20) 


The roots of this equation are à = 0 and A = 2. This yields the last two eigenvalues 
of S? : 0 and 2h?. An elementary calculation yields the corresponding eigenvectors: 


2 

/2 

= 

/2 
(of course, they are defined only to within a global phase factor ; the coefficients TRE 
insure their normalization). 

The operator S? therefore possesses two distinct eigenvalues : 0 and 2h?. 
The first one is non-degenerate and corresponds to vector (B-21-b). The second one 


is three-fold degenerate, and the vectors | +, + >, | —,— > and (B-21-a) form an 
orthonormal basis in the associated eigensubspace. 


[U +. -> +|. +>] for the eigenvalue 2h? (B-21-a) 


[|+.->—|-.+)] forthe eigenvalue 0 (B-21-b) 
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4. Results: triplet and singlet 


Thus we have obtained the eigenvalues of S? and S., as well as a system of 
eigenvectors common to these two observables. We shall summarize these results 
by expressing them in the notation of equations (B-12). 

The quantum number S of (B-12-b) can take on two values: 0 and 1. The 
first one is associated with a single vector, (B-21-b), which is also an eigenvector 
of S, with the eigenvalue 0, since it is a linear combination of | +, — > and | —, + >»: 
we shall therefore denote this vector by | 0,0 >: 


1 
M0. a EE (B-22) 
| VAL | ] 


Three vectors which differ by their values of M are associated with the value S = 1: 


LT =|+.+> 
1 
10>) =—=[|+.->+]-. +> 
| Fl | ] 
|L=1>=|=,-> (B-23) 


It can easily be shown that the four vectors | S, M > given in (B-22) and (B-23) 
form an orthonormal basis. Specification of S and M suffices to define uniquely 
a vector of this basis. From this, it can be shown that S? and $. constitute a C.S.C.O, 
(which could include S? and S5, although it is not necessary here). 

Therefore, when two spin 1/2°s (s, = 8, = 1/2) are added, the number S which 
characterizes the eigenvalues S(S + 1)A? of the observable S? can be equal either 
to l or to 0. With each of these two values of S is associated a family of (28 +1) 
orthogonal vectors (three for S = 1, one for S = 0) corresponding to the (2S + 1 ) 
values of M which are compatible with S. 


COMMENTS: 


(i) The family (B-23) of the three vectors | 1, M > (M = 1,0, — 1) constitutes 
what is called a tripler; the vector | 0. 0 > is called a singlet state. 

(ii) The triplet states are symmetric with respect to an exchange of two spins, 
whereas the singlet state is antisymunerric. This means that if each vector | £1. £3 > 
is replaced by the vector | £,, ¢, >, expressions (B-23) remain invariant, while 
(B-22) changes sign. We shall see in chapter XIV the importance of this 
property when the two particles whose spins are added are identical. Further- 
more, it enables us to find the right linear combination of | +, — > and 
| —, + > which must be associated with | +, + > and | —, — > (clearly sym- 
metric) in order to complete the triplet. The singlet state, on the other hand. is 
the antisymmetric linear combination of | +, — > and | —, + >, which is 
orthogonal to the preceding one. 


1008 


www.elsolucionario.net 
C. GENERAL METHOD 


C. ADDITION OF TWO ARBITRARY ANGULAR MOMENTA. 
GENERAL METHOD 


1. Review of the general theory of angular momentum 


Consider an arbitrary system, whose state space is &, and an angular momen- 
tum J relative to this system (J can be either a partial angular momentum or the 
total angular momentum of the system). We showed in chapter VI ($C-3) that it is 
always possible to construct a standard basis { | k, j, m > ) composed of eigenvectors 
common to J? and J.: 


Vik jm» = fj + Ih? [k,j m> (C-1-a) 
J,|k,j,m> = mh|k,j,m)> (C-1-b) 


such that the action of the operators J, and J_ obeys the relations: 


J, |k im> =h/ jj +1) — mm t lk jmz+1) (C-2) 


We denote by &(k, j) the vector space spanned by the set of vectors of the 
standard basis which correspond to fixed values of k and j. There are (2j + 1) of 
these vectors, and, according to (C-1) and (C-2), they can be transformed into each 
other by J”, J., J} and J. . The state space can be considered to be a direct sum of 
orthogonal subspaces &(k, j) which possess the following properties: 

(i) E(k, j) is (27 + 1)-dimensional. 

(ii) &(k, j) is globally invariant under the action of J?, J., J,, and, more 
generally, of any function F(J). In other words, these operators have non-zero 
matrix elements only inside each of the subspaces &(k, j). 

(iii) Inside a subspace &(k, j), the matrix elements of any function F(J) of 
the angular momentum J are independent of k. 


COMMENT: 


As we pointed out in $C-3-a of chapter VI, we can give the index k a concrete 
physical meaning by choosing for the standard basis the system of eigenvectors common 
to J*, J, and one or several observables which commute with the three components of J 
and form a C.S.C.O. with J? and J,. If, for example: 


[4,3] = 0 (C-3) 
and if the set { 4, J?, J, }isa C.S.C.O., we require the vectors | k, j, m > to be eigenvectors 
of A: 

A|k.j,m> =a | k, j m> (C-4) 
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Relations (C-1), (C-2) and (C-4) determine the standard basis { | k, j, m > } in this case. 
Each of the &(k, j) is an eigensubspace of A, and the index k distinguishes between the 
various eigenvalues a, , associated with each value of j. 


2. Statement of the problem 


a. STATE SPACE 


Consider a physical system formed by the union of two subsystems (for 
example, a two-particle system). We shall use indices 1 and 2 to label quantities 
relating to the two subsystems. 

We shall assume that we know, in the state space &, of subsystem (1), 
a standard basis { | k,,¿,, m, > } composed of common eigenvectors of J? and J,., 
where J, is the angular momentum operator of subsystem (1): 


Ji | kis jis Mi >= Ai T 1)h? | kis jis Mı > (C-5-a) 
Jiz | kojom > = mA |k jim > (C-5-b) 
Jia |kunjo m >) = n Vil +1) — m (m, + 1) | kojom + 1 > (C-5-c) 


Similarly, the state space &, of subsystem (2) is spanned by a standard basis 
{ | kz, ja, M2) $: 

J} | kaja m, > = Solin + YR? | ky, jy, m, > (C-6-a) 

Jz | kz, ja, mM, > = mAh | ks, ja, Mz > f (C-6-b) 

Jo | kz jm, > = ñil +1) — mam, + 1) | Kos ja. m,+1> (C-6-c) 


The state space of the global system is the tensor product of &, and &,: 
6=6&,@6&, (C-7) 
We know a basis of the global system, formed by taking the tensor product of the 


bases chosen in &, and &,. We shall denote by | k,, kz; Jis J23 mM,» Mz > the vectors 
of this basis: 


[kis Kaiji jzi Mi M> = | Kiji Mi & | kas Jo. M3) (C-8) 


The spaces $, and $, can be considered to be the direct sums of the sub- 
spaces é (ki, ji) and 6,(k,, ja), which possess the properties recalled in $C-1: 


6, = 2 Elk 41) (C-9-a) 


6,= 2 Falko i2) (C-9-b) 


Consequently, & is the direct sum of the subspaces $(k,, k>; jis J2) obtained by 
taking the tensor product of a space &,(k,, ji) and a space &,(k,, j,): 


6 = Y &(ky. kas jas 52) (C-10) 
® 
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with: 
lki, Kk2351,)2) = Elki ji) 9 alkaja) (C-11) 


The dimension of the subspace &(k,, ka; Jis J2) is (2/1 + 1)(2/, + 1). This subspace 
is globally invariant under the action of any function of J, and J, (J, and J, here 
denote the extensions into & of the angular momentum operators originally defined 
in &, and &, respectively). 


b. TOTAL ANGULAR MOMENTUM. COMMUTATION RELATIONS 


The total angular momentum of the system under consideration is defined by : 


where J, and J,, extensions of operators acting in the different spaces &, and ĉ,, 
commute. Of course, the components of J}, on the one hand, and of J,, on the other, 
satisfy the commutation relations which characterize angular momenta. It is easy 
to verify that the components of J also satisfy such relations [the calculation is the 
same as in (B-4)]. 

Since J, and J, each commute with J? and J3, so does J. In particular, J? 
and J, commute with J? and J%: 


[J.J] = [4.52] =0 (C-13-a) 

[J?, Jt] = [32,33] =0 (C-13-b) 
Furthermore, J,, and J,, obviously commute with J,: 

REZA Ea ERA =0 (C-14) 


but not with J? since this last operator can be written in terms of J, and J, in the 
form: 


P=3 +3 4+25,. 5, (C-15) 


and, as in (B-9), J,, and J}, do not commute with J, . J}. We can also transform 
the expression for J? into: 


P=P4+h4+ Wh, + Jida- + Jy da, (C-16) 


c. THE BASIS CHANGE TO BE PERFORMED 


A vector | ki, k2; ji» J2; My, M2 > of basis (C-8) is a simultaneous eigenstate 
of the observables: 


Ji, J$, Jia Jaz (C-17) 
with the respective eigenvalues j (j + 1)A?, jU + 1)A?, m,h, m,h. Basis (C-8) is 
well adapted to the study of the individual angular momenta J, and J, of the two 


subsystems. 
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According to (C-13), the observables: 
F, J.J, (C-18) 


also commute. We are going to construct an orthonormal system of common 
eigenvectors of these observables : this new basis will be well adapted to the study 
of the total angular momentum of the system. Note that this basis will be different 
from the preceding one, since J? does not commute with Jj, and J,. ($b above). 


COMMENT: 


To give a physical meaning to the indices k, and k,, let us assume (comment of 
$ C-1) that we know, in 8,, a C.S.C.O..[ 41,37. Jiz |. where A, commutes with the three 
components of J,, and, in 6,, a C.S.C O.,{ A>. J3, J, }, where A, commutes with the 
three components of J,. We can choose for a standard basis í | ki. ji. m; > } the ortho- 
normal system of eigenvectors common to A,, Jj,and J,.. and for í |k,.j,, m, > 5 the 
orthonormal system of eigenvectors common to 4,. J} and J,.. The set: 


{ Ay, Az: Si. J: Jiz Jy. 3 (C-19) 


then constitutes a C.S.C.O. in 6, whose eigenvectors are the kets (C-8) Since the 
observable A, commutes separately with the components of J, and with those of J,, 
it also commutes with J and, in particular. with J? and J.. The same is, of course, true 
of A,. Consequently, the observables : 


Ai Ay, Mid PS, (C-20) 


commute. We shall see that they in fact form a C.S.C.O.: the new basis we are trying 
to find is the orthonormal system of eigenvectors of this C.S.C.O. 


The subspace &(k,, k,; j,, J2) of & defined in (C-11) is globally invariant under 
the action of any operator which is a function of J, and J,, and, therefore, under 
the action of any function of the total angular momentum J. It follows that the 
observables J? and J., which we want to diagonalize, have non-zero matrix elements 
only between vectors belonging to the same subspace 6(k,, k,:/,./). The matrices 
(which are, in general, infinite) representing J? and J_ in the basis (C-8) are “ biock 
diagonal”, that is, they can be broken down into a series of submatrices, each of 
which corresponds to a particular subspace $(k,, kz; jis J2). The problem therefore 
reduces to a change of basis inside each of the subspaces &(k,, ky, jis >), which are 
of finite dimension (2/, + 1)(2/, + 1). 

Moreover, the matrix elements in the basis (C-8) of any function of J, and J, 
are independent of k; and k,. This is therefore true of those of J? and J,. Conse- 
quently, the problem of the diagonalization of J? and J, is the same inside all the 
subspaces &(k,, kai jis Ja) which correspond to the same values of j, and j,. It is for 
this reason that one usually speaks of adding angular momenta ji and j, without 
specifying the other quantum numbers. To simplify the notation. we shall henceforth 
omit the indices k, and k,. We shall denote by 4(/,, /,) the subspace 6 (ki, Kaiji hah 
and by | ji, jz; M,, M, >, the vectors of basis (C-8) belonging to this subspace: 


Elli da) = Elki Kaiji da) (C-21-a) 
|J j2: ys My > = | Kis Kaiji j2: Mi > (C-21-b) 
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Since J is an angular momentum and 6(j,, j2) is globally invariant under the 
action of any function of J, the results of chapter VI recalled above ($C-1) are 
applicable. Consequently, &(/,, j2) is a direct sum of orthogonal subspaces é(k, J), 
each of which is globally invariant under the action of J*, J,, J, and J_: 


êlo j2) = 2 6(k, J) (C-22) 


Thus, finally, we are left with the following double problem: 

(i) Given j; and j,, what are the values of J which appear in (C-22), and how 
many distinct subspaces &(k, J) are associated with each of them? 

(ii) How can the eigenvectors of 3? and J, belonging to é(j,, j2) be expanded 
on the {| jy, j2; mi, m3 > ) basis? 


$C-3 supplies the answer to the first question, and $C-4 to the second. 


COMMENTS: 


(i) We have introduced J, and J, as the angular momenta of two distinct 
subspaces. In fact, we know ($A-2) that we may want to add the orbital and 
spin angular momenta of the same particle. All the discussions and results 
of this section are applicable to this case, with 6, and 6, simply being 
replaced by é, and 6. 

(ii) In order to add several angular momenta, one first adds the first two, then 
the angular momentum so obtained to the third one, and so on until the last 
one has been added. 


3. Eigenvalues of J? and J. 


a. SPECIAL CASE OF TWO SPIN 1/2'S 


First of all, let us again take up the simple problem treated in §B. The 
spaces 6, and $, each contain, in this case, a single invariant subspace, and the 
tensor product space 6, a single subspace &(j,, ja), for which ji = j, = 1/2. 

The results recalled in $C-1 make it very simple to find the values of the 
quantum number S associated with the total spin. The space $ = &(1/2, 1/2) must 
be a direct sum of (2S + 1)-dimensional subspaces &(k, S). Each of these subspaces 
contains one and only one eigenvector of S, corresponding to each of the values 
of M such that |M| < S. Now, we know (cf. $B-2) that the only values taken on 
by M are 1, — l and O, the first two being non-degenerate and the third, two-fold 
degenerate. From this, the following conclusions can be deduced directly: 

(i) Values of S greater than 1 are excluded. For example, for S = 2 to be 
possible there would have to exist at least one eigenvector of S, of eigenvalue 2%. 

(ii) S = 1 occurs (since M = | does) only once : M = | is not degenerate. 


(iii) This is also true for S = 0. The subspace characterized by S = 1 includes 
only one vector for which M = 0, and this value of M is doubly degenerate in the 
space &(1/2, 1/2). 
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The four-dimensional space &(1/2, 1/2) can therefore be broken down into 
a subspace associated with S = | (which is three-dimensional) and a subspace 
associated with S = 0 (which is one-dimensional). 

Using a completely analogous argument, we shall defermine the possible 
values of J in the general case in which j, and j, are arbitrary. 


b. THE EIGENVALUES OF J, AND THEIR DEGREES OF DEGENERACY 


In accordance with the conclusions of §2-c, we shall consider a well-defined 
subspace £ (ji, j2), of dimension (2/, + 1)(27, + 1). We shall assume that j, and j, 
are labeled such that: 


j 2 ho (C-23) 
The vectors | j,, j,; Mı, M, > are already eigenstates of J, : 


J, | fas da} Mm, My = (Jy, + J22) | Jis J2} Mi, M2) 
= (m, + m,)Sh | jis J23 Mi M3) (C-24) 


and the corresponding eigenvalues Mh are such that: 
M=m, +m, (C-25) 
Consequently, M takes on the following values: 


Ji + da Si +) pnl St + Ja =2 » 0... T Us + jz) (C-26) 


To find the degree of degeneracy g, (M) of these values, we can use the 
following geometrical procedure. In a two-dimensional diagram, we associate with 
each vector | ji, J2; m,, m, > the point whose abscissa is m, and whose ordinate 
is m,. All these points are situated inside, or on the sides of, the rectangle whose 
corners are at (ji j2) Gis — jad (— Jis — Ja) and (— ji, Ja). Figure 1 represents 
the 15 points associated with the basis vectors in the case in which j, = 2 andj, = 1 
(the values of m, and m, are shown beside each point). All points situated on the 
same dashed line (of slope — 1) correspond to the same value of M = m, + my. 
The number of such points is therefore equal to the degeneracy g,, ,,(M) of this 
value of M. 

Now consider the various values of M, in decreasing order, tracing the line 
defined by each of them (fig. 1). M = ji +j is not degenerate, since the line it 
characterizes passes only through the upper right-hand corner, whose coordinates 
are (ji, j2): 


Iili +j) = 1 (C-27) 


M = ji + j2 — 1 is doubly degenerate, since the corresponding line contains the 
points (ji, ja — 1) and (j, — 1,/,): 


Dj, joi + Je = 1) =2 (C-28) 
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Pairs of possible values (m,, m,) for the kets | j,, j3; m,, m, >. We have chosen the case in 
which j; = 2 and j, = |. The points associated with a given value of M = m, + m, are situated 
on a straight line of slope — 1 (dashed lines). 


The degree of degeneracy thus increases by one when M decreases by one, until 


we reach the lower right-hand corner of the rectangle (m; = j,,m, = 


— ja) that is, 


the value M = j, — j,. The number of points on the line is then at a maximum and 


is equal to: 


gj, pi 


— jx) = 27, + 1 


(C-29) 


When M falls below j; — jz» 9;, ¡,(M) first remains constant and equal to its 
maximum value as long as the line associated with M cuts across the entire width 


92,1(M) 
== 
ax 
T N 
/ | ON 
f A E A E aa Lo 
T i ] 
| o 
-3 1 2 3 
FIGURE 2 


Value of the degree of degeneracy g, (M) as a function of M. As in figure 1, we have shown 
the case in which j, = 2 and j, = 1. The degree of degeneracy g;, (M) is simply obtained by coun- 


ting the number of points on the corresponding dashed line of figure 1. 
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of the rectangle, that is, until it passes through the upper left-hand corner of the 
rectangle (m; = — ji, Mm, = j3): 

9j,.¡,(M) =2j, + 1 for -(-h)<M <i, —-h (C-30) 
Finally, for M less than — (j; — /,), the corresponding line no longer intersects 
with the upper horizontal side of the rectangle, and g,, ,,(M) steadily decreases 


by one each time M decreases by one, again reaching 1 when M = — (ji + j2) 
(lower left-hand corner of the rectangle). Consequently: 
CRRA M) = Ij pM) (C-31) 


These results are summarized, for j, = 2 andj, = 1, in figure 2, which gives g, (M) 
as a function of M. 


c. THE EIGENVALUES OF J? 


Note, first of all, that the values (C-26) of M are all integral if j, and j, are 
both integral or both half-integral, and all half-integral if one of them is integral 
and the other half-integral. Consequently, the corresponding values of J will also 
be all integral in the first case and all half-integral in the second. 

Since the maximum value attained by M is ji + /,, none of the values 
of J greater than j, + j is found in &(/,, /,) and therefore none appears in the 
direct sum (C-22). With J = ji +j, is associated one invariant subspace (since 
M =j, + j exists) and only one (since M = ji + j, is not degenerate). In this 
subspace (J = ji + ja), there is one and only one vector which corresponds to 
M =j, +j —1; now this value of M is two-fold degenerate in 6(j,,/2); 
therefore, J = ji + j, — l also occurs, and to it corresponds a single invariant 
subspace &(J = ji + j, — 1). 

More generally, we shall denote by p,,_;,(/) the number of subspaces &(k, J) 
of £ (ji. ja) associated with a given value of J, that is, the number of different values 
of k for this value of J (j; and j, having been fixed at the beginning). p;, ; (J) and 
9 ¡, .¡, LM) are very simply related. Consider a particular value of M. To it corresponds 
one and only one vector in each subspace &(k, J) such that J > |M]. Its degree of 
degeneracy g; ¡,(M) in &(j,, /,) can therefore be written: 


Gj, j,(M) = Pj, J om |M]) F Pi, jl = |A4| + 1) 
+ Pj, (J = |M | +2)+... (C-32) 


Inverting, we obtain p,, (J) in terms of g, (M): 


Pill) = 9 ¿AM = J) — 9; [M = 4+ 1) 
= 95 M = — J) = 9; (M = —J—-1) (C-33) 
The results of §C-3-b then enable us to determine simply the values 
of the quantum number J which actually occur in &(/,, ja) and the number of 


invariant subspaces &(k, J) which are associated with them. First of all, we have, 
obviously: 


Pip) =0 for J>j +h (C-34) 
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since g; (M) is zero for |M| >j, +j} Furthermore, according to (C-27) 
and (C-28): 
Pipl =), + ja) = Gj, p (M =j, +j)=l (C-35-a) 
Pial ji thol) 
= Iji ¡ LM =j +j —1l)- 9 ju, ¡ LM =f, +f.) = 1 (C-35-b) 
Thus, by iteration, we find all the values of p,, ,, (J): 


Paal Sji th 2) = h.a (C-36-a) 

vo Pah Sh 2 2) =1 (C-36-b) 
and, finally, according to (C-30): 

Pal) 50 for J<j -jh (C-37) 


Therefore, for fixed /, and j,,that is, inside a given space &(/,, /,), the eigen- 
values of J? are such that* : 


ihl (C-38) 


With each of these values is associated a single invariant subspace é (J), so that 
the index k which appears in (C-22) is actually unnecessary. This means, in particular, 
that if we fix a value of J belonging to the set (C-38) and a value of M which is 
compatible with it, there corresponds to them one and only one vector in &(/, , J2): 
the specification of J suffices for the determination of the subspace & (J), in which 
the specification of M then defines one and only one vector. In other words, J? 
and J, form a C.S.C.O. in &(j/,, /2). 


J=ji +] Sith! a = 2 gts 


COMMENT: 


It can be shown that the number of pairs (J, M) found in & (ji. j,) is 
indeed equal to the dimension (27, + 1)(2/, + 1) of this space. This number 
(if, for example, j} > j,) is equal to: 


tj 
Y (27 +1) (C-39) 
J=d17 Ja 
If we set: 
Jej Hii (C-40) 


it is easy to calculate the sum (C-39): 


dite 2h 
Y (2J +1) 


J=j= 4 i 


RU, — 42) + Gi. +1) +2 


PU, j +) +1] 


It 
o 


2j2(2j, +1) 
2 


= (2j, + Mi, +1) (C-41) 


* Thus far, we have assumed j, > j,, but it is simple to extend the discussion to the opposite 
case j, < j,: all we need to do is invert indices 1 and 2. 
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4. Common eigenvectors of J? and J, 


We shall denote by | J, M > the common eigenvectors of J? and J, belonging 
to the space &(j,, j,). To be completely rigorous, we should have to recall the 
values of j, and j, in this notation, but we shall not write them explicitly, since 
they are the same as in the vectors (C-21-b) of which the | J, M > are linear 
combinations. Of course, the indices J and M refer to the eigenvalues of J? and J,: 


J? | J, MY 
J, |J, M> 


J(J +1}? | J, MY (C-42-a) 
Mh|J,M> (C-42-b) 


and the vectors | J, M ), like all those of the space &(j,, ja), are eigenvectors of J} 
and J? with eigenvalues j,(j, + 1)A? and j,(j, + 1)A? respectively. 


a. SPECIAL CASE OF TWO SPIN 1/2'S 


First of all, we shall show how use of the general results concerning angular 
momenta leads us to the expression for the vectors | S, M > established in $B-3. 
It will not be necessary to diagonalize the matrix which represents S?. By genera- 
lizing this method, we shall then construct ($ 4-b) the vectors | J, M > for the case 
of arbitrary j; and j}. 


a. The subspace &(S = 1) 


The ket | +, + > is, in the state space & = & (1/2, 1/2), the only eigenvector 
of S, associated with M = 1. Since S? and S, commute, and the value M = 1 
is not degenerate, | +, + > must also be an eigenvector of S? ($ D-3-a of chapter II). 
According to the reasoning of $C-3-a, the corresponding value of S must be 1. 
Therefore, we can choose the phase of the vector | S = 1, M = 1 > such that: 


|l,1> =| +, +> (C-43) 


It is then easy to find the other states of the triplet, since we know from the 
general theory of angular momentum that: 


S-|1,1> =A il + 1) — 1 — 1)/1,0> 
=hJ/2|1,0> (C-44) 
Consequently : 


KoE AEE (C-45) 


n/2 


To calculate | 1, 0 > explicitly in the { | e,, e, > } basis, it suffices to recall that 
definition (B-3) of the total spin S implies: 


S_ =S, +S, (C-46) 
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We then obtain: 


|1,0> =——=(S,_ + S,_)] +, +) 


=——[h|-, +> +h| +,- >] 
=—=[|-.+>+]+-)] (C-47) 


Finally, we can again apply S_ to | 1,0, that is, (S,_ + S,_) to expression (C-47). 
This yields: 


„+> +] +,->] 


IG 
1 

as peas] 

Ses (C-48) 


Of course, this last result could have been obtained directly, using an argument 
analogous to the one applied above to | +, + >. However, the preceding calcula- 
tion has a slight advantage : it enables us, in accordance with the general conventions 
set forth in §C-3-a of chapter VI, to fix the phase factors which could appear in 
| 1,0> and | 1, — 1 > with respect to the one chosen for | 1, 1 > in (C-43). 


P. The state|S =0,M =0) 


The only vector | S = 0, M = 0 of the subspace &(S = 0) is determined, 
to within a constant factor, by the condition that it must be orthogonal to the three 
vectors | 1, M > which we have just constructed. 

Since it is orthogonal to | 1,1 > = 
must be a linear combination of | +,-)> and | —, + yi 


which will be normalized if: 
<0,0]0,0> = Ja? + (pl? = 1 (C-50) 


We now insist that its scalar product with | 1, 0 > [cf. (C-47)] be zero: 


(a +$8)=0 (C-51) 


Ss 
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The coefficients « and f are therefore equal in absolute value and of opposite sign. 
With (C-50) taken into account, this fixes them to within a phase factor: 


E E (C-52) 


V2 
where y is any real number. We shall choose y = 0, which yields: 
2 
J/2 
Thus we have calculated the four vectors | S, M > without explicitly having 
had to write the matrix which represents S? in the { | ¢,, e, > } basis. 


|0,0> =—= [| => |4] (C-53) 


b. GENERAL CASE (ARBITRARY j, AND j) 


We showed in $C-3-c that the decomposition of &(/;, ja) into a direct sum 
of invariant subspaces & (J) is : 


Elo ja) = EU + fx) DE + dp 00.0 él — Jal) (C-54) 
We shall now see how to determine the vectors | J, M > which span these subspaces. 


a.  Thesubspace 6(J = ji + j2) 


The ket | ji j23 mM, =J,, M =j, > is, in (ji, j,), the only eigenvector 
of J, associated with M = ji + j,. Since J? and J, commute, and the value 
M = j + J, is not degenerate, | ji, ja; mi = j,, Ma =j} must also be an 
eigenvector of J?. According to (C-54), the corresponding value of J can only 
be j, + j,. We can choose the phase of the vector: 


\J=/ + Jz, M =j + J2? 

such that: 
li +jaji +j = | dido de> (C-55) 
Repeated application of the operator J_ on this expression enables us to 


complete the family of vectors | J, M > for which J = j, + j,. Thus, according to 
the general formulas (C-50) of chapter VI: 


J- NA + jadi + jr > = Ay i, + jo) [A +jaji +jz => 1> (C-56) 


We can therefore calculate the vector corresponding to J = ji +j} and 
M =j, +j — | by applying J. = J,_ + J,_ to the vector | ji, j2; po Ja >: 
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Lit +jsji +j — 1) === Y |j +i 1 + > 
n/ Aj, + j,) 


1 
h/ Ai, +j) 
_ 1 
DET 
+hv/ 2, AO = 1 >] (C-57) 


(J,- +3.) dde > 


AVZ lojiji — 1,52 > 


that is: 


lji tind) +j=1>)= %7 L [Ad = j> 
k 
| F 


+ ijeh =i C-58 
va f lidad >» (C-58) 


Indeed, note that we obtain in this way a linear combination of the two basis vectors 
which correspond to M = ji +j — l, and that this combination is directly 
normalized. 

We then repeat the procedure: we construct | j + j2, j +j — 2 > by 
letting J_ act on both sides of (C-58) (for the right-hand side, we take this operator 
in the form J,- + J,_), and so on, through | j, + j2, — (Ji +./2) >, which is found 
to be equal to | fy, j23 — jio — Ja >> 

We therefore know how to calculate the first [2(7, + /,) + 1] vectors of 
the {|/, M>} basis, which correspond to J = j, +j} and M =j, + J, 
J, +d. — l. — (Y, +.) and span the subspace &(/ = ji + j,) of Elija) 


B. The other subspaces & (J) 


Now consider the space S (ji + /,), the supplement of 6(/, +/,) in 6(j,./,). 
According to (C-54), F (jJi + j,) can be broken down into: 


Sli th) = EU + MOS +h 20.06 (li — jo) (C-59) 


We can therefore apply to it the same reasoning as was used in $0. 

In S(j, +. j2), the degree of degeneracy yj, ¡,(M) of a given value of M is 
smaller by one than g,, ¡,(M), since 6 (7, + jz) possesses one and only one vector 
associated with this value of M: 


(M) = g; AM) — 1 (C-60) 


This means, in particular, that M = ji +j, no longer exists in F(j, + j2) 
and that the new maximum value M = ji + j, — l is not degenerate. From 
this we see, as in $a, that the corresponding vector must be proportional to 
|J=j +j- 1l, M=j +7, — 1>. It is easy to find its expansion on the 
{ | jo Jz; Mi, M, > } basis, since, because of the value of M, it is surely of the 
form: 


E 
Ij J2 


lji thi-lLAth-bD = a | jida Ji d2 = 1> 
+ B de == l,j) (C-61) 
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with : 
la|? + BP = 1 (C-62) 


to insure its normalization. It must also be orthogonal to | ji + dad Eh 1> 
which belongs to &(j, + j,) and for which the expression is given by (C-58). The 
coefficients « and $ must therefore satisfy: 


| J d 
a | —— + p =0 (C-63) 
VÍ th Ea +) 


Relations (C-62) and (C-63) determine a and f£ to within a phase factor. We shall 
choose a and $ to be real and, for example, « positive. With these conventions: 


lJejzijej2 -1> 


| 


yii E7 


j tj- Lj +j-1)= 
lja 2 1 2 Vii = 


lJojzji — Li > (C-64) 


This vector is the first of a new family, characterized by J = j + j, — l. 
As in $ a, we can derive the others by applying J_ as many times as necessary. 
Thus we obtain [2(j, + j, — 1) + 1] vectors | J, M > corresponding to 


J=j,+j,-land M =j +j- l, ji tji- 2- — (i +j- 1} 
and spanning the subspace $ (J = j, + Jj, — 1). 


Now consider the space S (ji + j2,j, + j2 — 1), the supplement of the direct 
sum &(j, + j2) 8 EU + J, — 1) in &(/,,j2)* 


PU + jad, tj - 1) = élj +j,-2)0..0 él, — jal) (C-65) 


In S (ji +, +j — 1), the degeneracy of each value of M is again decreased 
by one with respect to what it was in S (y, + /,). In particular, the maximum value 
is now M = ji +j, — 2, and it is not degenerate. The corresponding vector of 
SF (jy + do ji +j — 1) must therefore be | J = ji +h — 2,M =j, +j, -—2>. 
To calculate it in the { | ji, j,; m,, mz > } basis, it is sufficient to note that it is 
a linear combination of the three vectors | j,,j23/:.J. — 2> lo 42541 — Li, l>, 
jo J23j1 — 2, j2 X. The coefficients of this combination are fixed to within a phase 
factor by the triple condition that it be normalized and orthogonal to 
lji t+ dod, + jo — 2>and| 7, +j — Ij, +j} — 2 > (which are already known). 
Finally, the use of J_ enables us to find the other vectors of this third family, thus 
defining &(/, + ja — 2). 

The procedure can be repeated without difficulty until we have exhausted all 
values of M greater than or equal to |j; — j,| [and, consequently, according 
o (C-31), also all those less than or equal to — |j; — j,|]. We then know all the 
desired | J, M > vectors. This method will be illustrated by two"examples in com- 
plement Ay. 


* Of course, S(j, + jz, Ji + j2 — 1) exists only if j, + j} — 2 is not less than |j, — Jl. 
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c. CLEBSCH-GORDAN COEFFICIENTS 


In each space é (ji, j2), the eigenvectors of J? and J, are linear combinations 
of vectors of the initial { | ji, j2; mi, m, > } basis: 


pM 2 $ lija mi, m, > < jo j23m,,m,| J, M > (C-66) 


=-ji m2=-j 


The coefficients < j,, j,; m,, m, | J, M ) of these expansions are called Clebsch- 
Gordan coefficients. 


COMMENT: 


To be completely rigorous, we should write the vectors | ,,j,;m,,m, > 
and |J, M > as |k,, ka; jis j2; My, M, > and [k,, Ko; jy, jas J, MY 
respectively [the values of k, and k,, like those of j; and j}, would then be 
the same on both sides of relations (C-66)|. However, we shall not write k, 
and k, in the symbols which represent the Clebsch-Gordan coefficients, since 
we know that these coefficients are independent of k, and k, (§C-2-c). 


It is not possible to give a general expression for the Clebsch-Gordan 
coefficients, but the method presented in §C-4-b enables us to calculate them by 
iteration for any values of j; and j,. For practical applications, there are numerical 
tables of Clebsch-Gordan coefficients. 

Actually, to determine the Clebsch-Gordan coefficients uniquely, a certain 
number of phase conventions must be chosen. [We mentioned this fact when we 
wrote expressions (C-55) and (C-64)]. Clebsch-Gordan coefficients are always chosen 
to be real. The choice then bears on the signs of some of them (obviously, the 
relative signs of the coefficients appearing in the expansion of the same vector | J, M > 
are fixed; only the global sign of the expansion can be chosen arbitrarily). 

The results of §C-4-b imply that < j,, j2; mi, m, | J, M > is different from 
zero only if : 


M=m, +m, (C-67-a) 
li -Al SISA +h (C-67-b) 


where J is of the same type (integral or half-integral) as j, + ja and |j — jo]. 
Condition (C-67-b) is often called the “triangle rule”: one must be able to form 
a triangle with three line segments of lengths j,, j, and J. 

Since the vectors | J, M > also form an orthonormal basis of the space 
&(j,,/,), the expressions which are the inverse of (C-66) can be written: 


Artie 


| Ji j2; Mi, m, > = y y [MIX M| jo ja;mMm m, > (C-68) 


J= jj M=-3 


Since the Clebsch-Gordan coefficients have all been chosen to be real, the scalar 
products appearing in (C-68) are such that: 


<J, M | dys j2; m, m, > = < hp j2; Mm, m | J, M > (C-69) 
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The Clebsch-Gordan coefficients therefore enable us to express the vectors of the 
old basis { | j,,j,;m,, m, > }, in terms of those of the new basis { | J, M > }. 

The Clebsch-Gordan coefficients possess interesting properties, some of which 
will be studied in complement B,. 


References and suggestions for further reading: 


Messiah (1.17), chap. XIH, §V; Rose (2.19), chap. HI. Edmonds (2.21), chaps. 3 
and 6, 

Relation with group theory : Meijer and Bauer (2.18). chap. 5. $5 and App. III 
of that chapter; Bacry (10.31). chap. 6; Wigner (2.23). chaps. 14 and 15. 

Vectorial spherical harmonics : Edmonds (2.21), $5-10; Jackson (7.5), chap. 16; 
Berestetskii et al. (2.8), §§6 and 7; Akhiezer and Berestetskii (2.14), $4. 
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COMPLEMENTS OF CHAPTER X 


Ax: EXAMPLES OF ADDITION 
OF ANGULAR MOMENTA 


Bx: CLEBSCH-GORDAN COEFFICIENTS 
C,: ADDITION OF SPHERICAL HARMONICS 


Ax: illustrates the results of chapter X by the 
simplest cases not treated in detail in this chapter : 
two angular momenta equal to 1. and an integral 
angular momentum / with a spin 3. Easy, 
recommended as an exercise illustrating methods 


of addition of angular momenta. 


By. Cy: technical complements intended to 
demonstrate certain useful mathematical results ; 
can be used as references, 

B,: study of Clebsch-Gordan coefficients, 
which frequently appear in physical problems 
involving angular momentum and rotational 
invariance, 

Cy: proof of an expression concerning the 
product of spherical harmonics ; useful for certain 
subsequent complements and exercises. 


Dx: VECTOR OPERATORS : 
THE WIGNER-ECKART THEOREM 


Ex: ELECTRIC MULTIPOLE MOMENTS 


F: EVOLUTION 
OF TWO ANGULAR MOMENTA J, AND J: 
COUPLED BY AN INTERACTION aJ, . J, 


Dx. Ex: introduction of physical concepts 
(vector observables, multipole moments) which 
play important roles in numerous fields. 

D, : study of vector operators: proof of the 
Wigner-Eckart theorem, which establishes pro- 
portionality rules between the matrix elements of 
these operators. Rather theoretical, but recom- 
mended for its numerous applications. Can be 
helpful in an atomic physics course (the vector 
model, calculation of Landé factors, etc.). 

E, : definition and properties of electric mul- 
tipole moments of a classical or quantum mecha- 
nical system: study of their selection rules (these 
multipole moments are frequently used in atomic 
and nuclear physics). Moderately difficult. 


Fy : can be considered to be a worked exercise, 
treating a problem fundamental to the vector 
model of the atom: the time evolution of two 
angular momenta J, and J, coupled by an 
interaction W = aJ, . J}. This dynamical point 
of view completes, as it were, the results of 
chapter X concerning the eigenstates of W. 
Fairly simple. 


(continued on the next page) 
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Gx: EXERCISES Gx: exercises 7 to 10 are more difficult than 
the others. Exercices 7, 8, 9 are extensions of 
complements D, and F, (concept of a standard 
component and that of an irreducible tensor 
operator, the Wigner-Eckart theorem). Exer- 
cise 10 takes up the problem of the various ways 
of coupling three angular momenta. 
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Complement Ax 
EXAMPLES OF ADDITION OF ANGULAR MOMENTA 


1. Addition of j, = l and j, = 1 
a. The subspace € (J = 2) 
b. The subspace € (J = 1) 
c. The vector| J = 0,M =0> 


2. Addition of an integral orbital angular momentum / and a spin 1/2 


a. The subspace 8 (J = 1 + 1/2) 
b. The subspace &(J = 1 — 1/2) 


To illustrate the general method of addition of angular momenta described in 
chapter X, we shall apply it here to two examples. 


1. Addition of ji = 1 andj, = 1 


First consider the case in which j} = j, = 1. This is the case, for example, 
for a two-particle system in which both orbital angular momenta are equal to 1. 
Since each of the two particles is then in a p state, this is said to be a “p? configu- 
ration”. 

The space &(1, 1) with which we are concerned has 3 x 3 = 9 dimensions. 
We assume the basis composed of common eigenstates of J7, J3, J,, and J,, to be 
known : 


1,1;m,,m, >}, with m,,m, = 1,0, — 1 l 
1 2 1 2 


and we want to determine the { | J, M > } basis of common eigenvectors of Jĝ, 
33, J? and J,, where J is the total angular momentum. 

According to §C-3 of chapter X, the possible values of the quantum 
number J are: 


J=2,1,0. (2) 


We must therefore construct three families of vectors | J, M >, containing, respec- 
tively, five, three and one vectors of the new basis. 
a. THE SUBSPACE £(J = 2) 


The ket | J = 2, M = 2 > can be written simply: 
[228 = LS (3) 
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Applying J_ to it, we find the vector | J = 2, M = 1): 


ye ta (es 


Mit LL TS 


gi- 


Ed 
2h 


[n/2]1,1;0,1) +AV2|1,1;1,0>] 


=[|1,151,0> +|1,150,1)] (4) 


va 


We use J_ again to calculate | J = 2, M = 0 >. After a simple calculation, we 
find: 


as 


pa a 1> +2]1,1;0,0> +]1,1;- 1,1] (5) 
then: 

21) = EE) -1> +11; = 1,0] (6) 
and, finally : 

|2,- 2) = | 1,1; = L,- 15 (7) 


b. THE SUBSPACE (J = 1) 


We shall now proceed to the subspace &(J = 1). The vector | J = 1, M = 1) 
must be a linear combination of the two basis kets | 1, 1; 1, 0 > and | 1,1; 0, 1 > 
(the only ones for which M = 1): 

LD =a]1,1;1,0> +B] 1,1;0,1> (8) 
with: 

la? + |B)? = 1 (9) 


For it to be orthogonal to the vector | 2, 1 >, it is necessary [cf. (4)] that: 
a+p=0 (10) 


We choose a and $ to be real, and choose, by convention, a positive*. Under these 
conditions: 


111) =e [} 4, 131,0) —]4,1:0,1] (11) 
2 


F 


* In general, the component of the ket | J, J > on the ket | j j2; m, =J/,, m = J —J, > is 
always chosen to be real and positive (cf. complement By, $ 2). 
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Application of J_ here again enables us to deduce | 1,0 > and | 1, — 1 >. We easily 
find, using the same technique as above: 


Le ies SM (12) 
52 


1 
SA ee a (13) 
2 
It is interesting to note that expansion (12) does not contain the vector | 1, 1; 0,0 >, 


although it also corresponds to M = Q. It so happens that the corresponding Clebsch- 
Gordan coefficient is zero : 


(1,1;0,0]1,0>=0 (14) 


c. THE VECTOR |J=0M=0> 


We are left with the calculation of the last vector of the { | J, M > } basis, 
associated with J = M = 0. This vector is a linear combination of the three basis 
kets for which M = 0: 


|0,0> =a|1,1;1,— 1> +6]|1,1;0,0> +c]1,1;-1,1> (15) 
with : 

la? + |b)? + lel? =1 (16) 
It must also be orthogonal to |2,0 > [formula (5)] and | 1, 0 > [formula (12)]. 
This gives the two conditions: 

a+2b+c=0 (17-a) 

a=c=0 (17-b) 
These relations imply: 

a=-b=c (18) 


We again choose a, bh and c real, and agree to choose a positive (see note, p. 1028). 
We then obtain, using (16) and (18): 


}0,0> =-[]1,131,-1>—]1,1:0,0) +|1,1; = 1,1>] (19) 
V3 


This completes the construction of the {|J, M> } basis for the case 
Jy Hh, = 1. 
COMMENT: 


If the physical problem under study is that of a p? configuration of a 
two-particle system, the wave functions which represent the states of the 
initial basis are of the form: 


(FP | 1, 1;m,,m,> = Re, 1 (11) Re, 1 (r2) Yr"(0,, G1) YTO, p,) (20) 
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where r, (ri, 0,,(,) and r,(r,, 0,, p,) give the positions of the two particles. 
Since the radial functions are independent of the quantum numbers m, 
and m,, the linear combinations which give the wave functions associated 
with the kets | J, M > are functions only of the angular dependence. For 
example, in the { | r,, r, > } representation, equation (19),can be written: 


1 = 
[ri t3 | 0,0 > = Ry, (11) Ry, (12) [Y (0, pr) Y; (0), 6) 


J3 


= Y 1(0,, (1) Y:(0,, 2) + Y, (0, pı) Y (0), P) (21) 


2. Addition of an integral orbital angular momentum / 
and a spin 1/2 


Now consider the addition of an orbital angular momentum (j, = /,an integer) 
and a spin 1/2 (j, = 1/2). This problem is encountered, for example, whenever one 
wants to study the total angular momentum of a spin 1/2 particle such as the 
electron. 

The space &(/, 1/2) which we are considering here is 2(2/ + 1)-dimensional. 
We already know a basis of this space* : 


{|L1/2;me } with m=11-—-1,.., — land € = + (22) 
formed of eigenstates of the observables L?,S*, L, and S,, where L and S are the 
orbital angular momentum and spin under consideration. We want to construct the 


eigenvectors | J, M > of J? and J,, where J is the total angular momentum of the 
system: 


J=L+S (23) 


First of all, note that if / is zero, the solution to the problem is obvious. 
It is easy to show in this case that the vectors | 0, 1/2; 0, e > are also eigenvectors 
of J? and J, with eigenvalues such as J = 1/2 and M = e/2. On the other hand, 
if / is not zero, there are two possible values of J: 


1 1 
=] 4^ agri 
J=l zo> l > (24) 
a. THE SUBSPACE (J = I + 1/2) 


The (2/ + 2) vectors | J, M > spanning the subspace &(J = / + 1/2) can be 
obtained by using the general method of chapter X. We have, first of all: 


1 1 
JI +5,1 45> = [1556 +> (25) 


* If we wanted to conform strictly to the notation of chapter X, we should have to write + 1/2, 
and not e, in the basis kets. But we agreed in chapters IV and IX to denote the eigenvectors of S, in the 
spin state space by | + > and | — >. 
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Through the action of J_ (*), we obtain +51 -4): 
1 1 1 1 1 
Ji +5,1 -+> = — J]. |l +5,1 +5) 
2O 2° AA+ ? 2 
= — (1-48) L;i +>) 
naa 
eee. 1-1, +) +h ihe 
nes 2 
21 1 1 1 
Jr í Va +1 2 


We apply J_ again. An analogous calculation yields: 


rr vi] 
ET 


1 
zl 2, +> 


p= 4, = (27) 


More generally, the vector | / + 1/2, M > will be a linear combination of the 
only two basis vectors associated with M: | l, 1/2; M—1/2,+ > and 
(Mis, of course, half-integral). Comparing (25), (26) and (27), 
we can guess that this linear combination should be the following one : 


1 1 1 
zm) | ji +m +5 Ll. M — =, +> 
2 V2 +1 Lv Bee 2 
+ E ->| 23) 
Vy Za 2 
with: 
1 1 3 1 1 
M=1+5 => BF at (PD) (29) 


* To find the numerical coefficients appearing in the following equations, we can simply use the 
relation: jj + 1) — m(m — 1) = (+ mu — m + 1). 
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Reasoning by recurrence, we can show this to be true, since application of J_ to 
both sides of (28) yields: 


Ji +5,M — 1) = si +3, M) 
1 3 
h(i +m +3) M +5 
1 1 
pew roma e? 
y 2 2 
1. | 1 3 1 3 
TáamMmstbon aw eae = wt | ew =. +) 
y a 2 2 2 2 
| 1 1 1 
+ j1+M ten 
j 2 2 
\ 
/ 1 1 1 
V 2N 
1 1 1 3 
|I +M- e > +> 
~ Jat ily 


i 3 1 1 
Da Os +5 ls M 3 >| (30) 


We indeed obtain the same expression as in (28), with M changed to M — 1. 


b. THE SUBSPACE ¿(Y = I — 1/2) 


We shall now try to determine the expression for the 2/ vectors | J, M > 
associated with J = / — 1/2. The one which corresponds to the maximum value 
l — 1/2 of M is a normalized linear combination of |/,1/2;/— 1, + > and 
, — >, and it must be orthogonal to | / + 1/2, / — 1/2 > [formula (26)]. 
real and positive (cf. note p. 1028), we 


easily find: 


1 
Lash — > a 


IM 
ee +> (31) 


The operator J_ enables us to deduce successively all the other vectors of the 
family characterized by J = / — 1/2. Since there are only two basis vectors with 
a given value of M, and since | / — 1/2, M > is orthogonal to | / + 1/2, M >, (28) leads 
us to expect that: 


J1-5,M) = =| +M +3 ama a 
= 2 2 
we 
- fina ed jag +>] 63 
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for: 


1 3 3 1 
5 > rey TEES ae (33) 


By an argument analogous to the one in §2-a, this formula can also be proved by 
recurrence. 


COMMENTS: 


(i) The states | /, 1/2; m, € > of a spin 1/2 particle can be represented by two- 
component spinors of the form: 


[itm ele) = Ras) YTO 0) (4) (34-a) 
[Vit :m -1E = Ri,dr) Y7O, 9) (") (34-b) 


The preceding calculations then show that the spinors associated with the 
states | J, M > can be written: 


E 
Mi +M +> Y 20, 9) 
i | 
[Yim] = TT Ban reer ae 
J! - M +5 YI, 9) (35-a) 


I 1 1 
- jl- M +5 YI" 20, 9) 
1 Ny 
A) A i 
an ME Y"+3(0, p) (35-b) 


(ii) In the particular case / = 1, formulas (25), (28), (31) and (32) yield : 


35> =[ 1351 +> 

E 3) = Al 1,350, +> + 1,351 = 

E PI + £ 1,550, —> 

= EL a 
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and: 
: 
liia Bigla esah: 
Barm eL DCI 
1 1 1 1 24, 1. 
A PO a (36-b) 


References and suggestions for further reading: 


Addition of an angular momentum / and an angular momentum S = 1 : see 
“vectorial spherical harmonics” in the references of chapter X. 
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Complement Bx 
CLEBSCH-GORDAN COEFFICIENTS 


1. General properties of Clebsch-Gordan coefficients 
a. Selection rules 
b. Orthogonality relations 
c. Recurrence relations 


2. Phase conventions. Reality of Clebsch-Gordan coefficients 
a. The coefficients < ji, J2; m,, mz | J, J >; phase of the ket | J, JY 
b. Other Clebsch-Gordan coefficients 

3. Some useful relations 


a. The signs of some coefficients 

b. Changing the order of j, and j, 

c. Changing the sign of M, m, and m, 
d. The coefficients < j, j; m, -— m| 0,0» 


Clebsch-Gordan coefficients were introduced in chapter X [cf. relation (C-66)] : 
they are the coefficients < j,, j,; m,, m, | J, M > involved in the expansion of the 
ket | J, M > on the { | ji. j2; mj» m, > } basis: 


J2 
|J,M > = 2 2 <j ins my, m2 |J, M >| ji jm, m, > (1) 


-ji m=-j2 


In this complement, we shall derive some interesting properties of Clebsch- 
Gordan coefficients, some of which were simply stated in chapter X. 

Note that, to define the < ji, j2; m,, m, | J, M > completely, equation (1) 
is not sufficient. The normalized vector | J, M > is fixed only to within a phase factor 
by the corresponding eigenvalues J(J + 1)h? and Mh, and a phase convention 
must be chosen in order to complete the definition. In chapter X, we used the 
action of the J_ and J, operators to fix the relative phase of the (2J + 1)kets | J, M > 
associated with the same value of J. In this complement, we shall complete this 
choice of phase by adopting a convention for the phase of the kets | J, J >. This will 
enable us to show that all the Clebsch-Gordan coefficients are then real. 

However, before approaching, in § 2, the problem of the choice of the phase 
of the < ji, jz; Mı, M, |J, M X, we shall, in $1, study some of their most useful 
properties which do not depend on this phase convention. Finally, §3 presents 
various relations which will be of use in other complements. 


1. General properties of Clebsch-Gordan coefficients 


a. SELECTION RULES 


Two important selection rules, which follow directly from the results of 
chapter X concerning the addition of angular momenta, have already been given 
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in this chapter [cf. relations (C-67-a) and (C-67-b)]. We shall simply restate them 
here : the Clebsch-Gordan coefficient < j,, j,3 Mj, M, | J, M > is necessarily zero 
if the following two conditions are not simultaneously satisfied : 


M = my, + mM, (2) 
lji — jl <<], tj (3-a) 


Inequality (3-a) is often called the “triangle selection rule”, since it means that 
a triangle can be formed with three line segments of lengths j; , j, and J (cf. fig. 1). 
These three numbers therefore play symmetrical roles here, and (3-a) can also be 
written in the form: 


-il Sh SI +), (3-b) 
or: 
Y =p] Si <I + (3-c) 
J 
h FIGURE | 


Triangle selection rule : the coefficient < ¿,,/,3m,.m,1J, M> 
can be different from zero only if it is possible to form 
jy a triangle with three line segments of lengths /,. 7,. J. 


Moreover, the general properties of angular momentum require that the 
ket | J, M > and, therefore, the coefficient < ji, /,; m,, Ma |J, M >, exist only 
if M takes on one of the values: 


Med ol = 1 yl Sao gous SS (4-a) 
Similarly, it is necessary that: 

m =Jpojy =l,- — ji (4-b) 

Mm = jaji — l,- — he (4-c) 
If this is not the case, the Clebsch-Gordan coefficients are not defined. However, 
in what follows, it will be convenient to assume that they exist for all m,,m, and M, 
but that they are zero if at least one of conditions (4) is not satisfied. These 
relations thus play the role of new selection rules for the Clebsch-Gordan 


coefficients. 
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ORTHOGONALITY-RELATIONS 


Inserting the closure relation* : 


> > ljo j2; m, m, > < jija: my, m) | = 1 (5) 


m=-ji m=-jz2 


in the orthogonality relation of the kets | J, M >: 


<J, M| J', M' = 653: Ômm' (6) 


we obtain: 


ji j2 
$ . D l <J, M | jis j2; Mi, m > jis j2; Mi, Mz | J’, M' > = òy Óum 
m3 =-7Jm=-7J2 


(7-a) 


We shall see later [cf. relation (18-b)] that the Clebsch-Gordan coefficients are 
real, which enables us to write this relation in the form: 


ji j2 
X l y < jis J2: MM) | J,M > <j jz: m, m | J’, M’ > = 65) mm 
mi=— jit m2= —j2 
(7-b) 


Thus we obtain a first “orthogonality relation” for the Clebsch-Gordan coefficients. 
We note, moreover, that the summation which appears in it is performed, in fact, 
over only one index : for the coefficients of the left-hand side to be different from 
zero, m, and m, must be related by (2). 


Similarly, we insert the closure relation: 


j +j2 J 
Y |J,M><I,M|=1 (8) 
J 


J=li=j]M=- 


in the orthogonality relation of the kets | j,, j2; Mi, M, >; we obtain: 


+ J 
y ¿iva mi, m, | J, M > < J, M | Fis j2; mi, m, » = Ô mimi ecg 
J=lji—=j] M=-J 

(9-a) 


that is, with (18-b) taken into account: 


tj 


J 
y ji j2; Mi, M3 | J, M >< Jj mi, m | J, M > = Omimiómam) 
J 


J=|ji- j| M=- 
(9-b) 


Again, the summation is performed over only one index: since we must have 
M = m, + m,, the summation over M reduces to a single term. 


* This closure relation is valid for a given subspace &(k,, kz: ji» j2) (cf. chap. X, §C-2). 
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c. RECURRENCE RELATIONS 


In this section, we shall use the fact that the kets |j}, j2; m,, Mm, > form a 
standard basis. Thus: 


Jiz |j j2; m m, > = h/ iis +1) — m(m, + 1) | dy. jz; m + 1,m, > 


Joe heres m,m: > =hv/ jj, +1) — m,(m, + 1) | Ji j2; m1, m3 al 


(10) 
Similarly, by construction, the kets | J, M > satisfy: 


Je |QM>=hV JJ +1) - MM + D]J,M21> (11) 


We shall therefore apply the J_ operator to relation (1). Since J_ =J,_ + J,_, 
we obtain (if M > — J): 


VNS +1) — M(M —1)|J,M—1)>= 


ji j2 
es Y <div ios mm, | J, M > 


my = — ji m2= — j2 


x [Vj + 1) — mlm, — 2) [Ja insm, — 1, m, > 
. . 1 , i m . , m 
+v jaja +1) — mm — 1) | Ji j2; M m, — 1>] (12) 


Multiplying this relation by the bra < ji, j2; ™,, m, |, we find: 


VJJ +1) — M(M — 1) <jajaim, m |J, M — 1> 
= Vili: + 1) — mim, +1) <jij23m, + 1m, | J,M > 
+ V7 jaja +1) — mam, +1) <j j.3my,m, +1]1J,M> (13) 
If the value of M is equal to — J, we have J_ | J, — J > = 0, and relation (13) 
remains valid if we use the convention, given above in §1-b, according to which 
< jis j2; Mı, m |J, M > is zero if |M| > J. 


Analogously, application of the operator J} = Ji} + J,4 to relation (1) 
leads to: 


= Vili + 1) — m(m, — 1) <jy.j.3m, — 1,m, | J,M > 
+ Viaja +1)-— m(m, — 1) < jis j2; Mı, m, — 1 | J, M > (14) 


(the left-hand side of this relation is zero if M = J); (13) and (14) are recurrence 
relations for the Clebsch-Gordan coefficients. 
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2. Phase conventions. Reality of Clebsch-Gordan coefficients 


As we have seen, expressions (12) fix the relative phases of the kets | J, M > 
associated with the same value of J. To complete the definition of the Clebsch- 
Gordan coefficients involved in (1), we must choose the phase of the various 
kets | J, J >. To this end, we shall begin by studying some properties of the 
coefficients < j}, jz; Mp ma |J, J >. 


a. THE COEFFICIENTS <j j: m, m>|J, J): PHASE OF THE KET |J, J? 


In the coefficient < j,, j2; m,,m, | J, J >, the maximum value of m, is m, = j,. According 
to selection rule (2), m, is then equal to J — j, [whose modulus is well below j,, according 
to (3-b)]. As m, decreases from this maximum value j,, one unit at a time, m, increases until it 
reaches its maximum value m, =j, [m, is then equal to J —j,, whose modulus is 
well below j,, according to (3-c)]. In theory, therefore, (j, + ja — J + 1) non-zero Clebsch- 
Gordan coefficients < j}, j,; m,, m, | J, J > can exist. We are going to show that, in fact, none 
of them is ever zero. 

If we set M = J in (14), we obtain: 

¡AN Oe on 
jaz + 1) — mom, - 1) 


EN < jija; mm =1 J,J> 
Vilh + 1 — mtm - 107 A | 


(15) 
The radical on the right-hand side of this relation is never zero, nor is it infinite, so long as the 
Clebsch-Gordan coefficients appearing there satisfy rules (4-b) and (4-c). Relation (15) therefore 
shows that if < j,, j23J,, J — ji | J, J > were equal to zero, < fy, otf ~ 1 SI HH 113,45 
would be zero as well, as would be all the succeeding coefficients < j, , j2 ;m,, J — m, |J, J >. Now, 


this is impossible, since the ket | J, J >, which is normalized, cannot be zero. Therefore, all the 
coefficients < j}, ja; m,, J — m, | J, J > (with ji > m, > J — j,) are different from zero. 


<j j23}m, — Im, IJ, JY =a 


In particular, the coefficient < ji, j2; jp J — Jj, | J, J >, in which m, takes on its 
maximum value, is not zero. To fix the phase of the ket | J, J >, we shall require 
this coefficient to satisfy the condition : 


Cdpdeidi JJ 14,3) real and positive (16) 
Relation (15) then implies by recurrence that all the coefficients 

(Jia Mm) —m, PES 
are real [their sign being (— !)'~™ ]. 


COMMENT: 


The phase convention we have chosen for the ket | J, J > gives the two 
angular momenta J, and J, asymmetrical roles. It actually depends on the 
order in which the quantum numbers j} and j, are arranged in the Clebsch- 
Gordan coefficients : if j, and j, are permuted, the phase of the ket | J, J > 
is fixed by the condition : 


Ciojiijad—j1J, JY real and positive (17) 


which is not necessarily equivalent, a priori, to (16) [(16) and (17) may define 
different phases for the ket | J, J >]. We shall return to this point in $3-b. 
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b. OTHER CLEBSCH-GORDAN COEFFICIENTS 


Relation (13) enables us to express, in terms of the < j}, j2; Mmi, Mma |J, J >, 
all the coefficients < ji, jz; Mi, M, |J, J — 1), then all the coefficients 
< ji j2; my, mM, | J, J — 2), etc. This relation, in which no imaginary numbers 
are involved, requires that all Clebsch-Gordan coefficients be real: 

< Jis J2} My, Mz | J, M >* = < ji J2} My, Ma |J, M> (18-a) 
which can also be written: 

< jo jai my, m, |J, M ò = (I, M | fy, j2; m» m Y (18-b) 


However, the signs of the < j4, j2; m,, m, | J, M > do not obey any simple 
rule for M # J. 


3. Some useful relations 


In this section, we give some useful relations, which complement those given in §1. 
To prove them, we shall begin by studying the signs of a certain number of Clebsch-Gordan 
coefficients. 


a. THE SIGNS OF SOME COEFFICIENTS 


A. The coefficients < jy, j2; My Mal ji + ja MY 


Convention (16) requires the coefficient < ji, J2; jis 42 | Ji + Jz Ji + j > to be real and 
positive; it is, moreover, equal to 1 (ef. chap. X, $ C-4-b-a). Setting M = J = ji + j in (13), 
we then see that the coefficients < ji, j2; My, m2 | ji +2 ji + 2 — 1) are positive. By 
recurrence, it is then easy to prove that : 


< joj: Mms m lj +j M> 2 0 (19) 


B. Coefficients in which m, has its maximum value 


Consider the coefficient < j,, jz; m,, M, | J, M >. In theory, the maximum value of m, 
is m, = jı. However, we then have m, = M — j,, which, according to (4-c), is possible only 
if M —j, > —j,, that is: 


M2 hi Jj (20) 
If, on the other hand : 


M Sj, + (21) 


the maximum value of m, corresponds to the minimum value of m, (m, = — /,), and is therefore 
equal tom, = M + jz. 

Let us show that all Clebsch-Gordan coefficients for which m, has its maximum value 
are non-zero and positive. To do so, we set m, = j; in (13); we find: 


VIJ +1) — MM — 1) iy, inte m2 |J, M — 19 


= Vidi +1) — mm, + 1D)< jo jj Mm, +1 | J,M> (22) 
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Using this relation, an argument by recurrence starting with (16) shows that all the coefficients 
< jijiji, M — J, |J, M > are positive [and non-zero if M satisfies (20)]. Analogously, setting 
m, = — j, in (14), we could prove that all the coefficients < j,, j23 M + ja, — ja |J, M > are 
positive [if M satisfies (21)]. 


Y. The coefficients < ji, j2: my, mz | J, J > and < jija; m, m | J, -J> 
We saw in $ 2-a that the sign of < ji, J2; Mi, m |J, J> is (— 1)! ™, In particular: 
the sign of < ji j2} J — ja j2 |J, J> =(= 1) tes (23) 


To determine the sign of < ji, j2; mi. M, | J, — J >, we can set M = — J in (13), whose 
left-hand side then goes to zero. We therefore see that the sign of < j}, ja; m,, ma |J, ~ J> 
changes whenever m, (or m,) varies by + 1. Since, according to § B, < j,,/.3/. — J. —j2 1d, ~J Y 
is positive, it follows that the sign of < j1, J2; m, m, | J, — J >is (— 1)"2*2, and, in particular: 


the sign of < j jz: — jr -J +h | J, —~J>=(- [ites (24) 


b. CHANGING THE ORDER OF j, AND j; 


With the conventions we have chosen, the phase of the ket | J, J > depends on the order 
in which the two angular momenta j; and j, are arranged in the Clebsch-Gordan coefficients 
(cf. comment of $ 2-a). If they are taken in the order j,, j}, the component of | J, J > along 
li. /2ij:.4 —J1> is positive, which means that the sign of the component along 
lite doi: J — Jar ja > is (— 1) *27Y, as is indicated by (23). On the other hand, if we pick the 
order /,, ji, relation (17) shows that the latter component is positive. Therefore, if we invert j; 
and j,, the ket | J, J > is multiplied by (— 1)'*~/. The same is true for the kets | J, M >, which 
are constructed from | J, J > by the action of J _ in such a way that the order of j, and j, plays no 
role. Finally, the exchange of j; and j, leads to the relation : 


Cia Ji; Ma Mm, | J, M>= (- 1t È jaja; Mi, Mm, | J,M> (25) 


c. CHANGING THE SIGN OFM, m, AND m, 


In chapter X and in this complement, we have constructed all the kets | J, M > (and, 
therefore, the Clebsch-Gordan coefficients) from the kets | J, J >, by applying the operator J_. 
We can take the opposite point of view, and start with the kets | J, — J >, using the operator J ,. 
The reasoning which follows is exactly the same, and we find for the kets | J, — M > the same 
expansion coefficients on the kets | j,, j2; — m,, — m, X as for the | J, M > on the | j, j2; mpm, >. 
The only differences that can appear are related to the phase conventions for the kets | J, M >, 
since the analogue of (16) then requires < j,, jz; —= Ji — YI +ji| J, — J> to be real and 
positive. Now, according to (24), the sign of this coefficient is, in reality, (— 1)1*227%. Conse- 
quently : 


Cies — My. — Mm | J, —-M>= (- PAE Jija Mi, Ma | J, M> (26) 


In particular, if we set m, = m, = 0, we see that the coefficient < j}, ja; 0, 0 | J, 0 > is 
zero when j; + ją — J is an odd number. 


d. THE COEFFICIENTS <j.j/:m. - ml|0,0> 


According to (3-a), J can be zero only if j; and j, are equal. We therefore substitute the 
values ji = j, =j, m =m,m, = —m— land J = M = 0 into (13); we obtain: 


<jjim+ 1l, - (m+ 1)[0,0> = -<j jm, —m|0,0> (27) 
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All the coefficients < j, j; m, — m | 0, O > are therefore equal in modulus. Their signs change 
whenever m varies by one, and, since < j, j; j, — j |0, 0 > is positive, it is given by (— 1)”. 
Taking into account orthogonality relation (7-b), which indicates that : 


J 
Y <jjim —-m|0,0)? =1 (28) 
m=-j 
we find: 
; (— 1)" 
2j +1 
References : 


Messiah (1.17), app. C; Rose (2.19), chap. II and app. I; Edmonds (2.21), chap. 3; 


Sobel’man (11.12), chap. 4, §13. 
Tables of Clebsch-Gordan coefficients : Condon and Shortley (11.13), chap. HI, 


§14; Bacry (10-31), app. C. 
Tables of 3j and 6j coefficients : Edmonds (2.21), Table 2; Rotenberg et al. (10.48). 
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ADDITION OF SPHERICAL HARMONICS 


1. The functions #¥ (Q, ; Q,) 
2. The functions F7 (Q) 
3. Expansion of a product of spherical harmonics ; the integral of a product of three spherical harmonics 


In this complement, we use the properties of Clebsch-Gordan coefficients to prove 
relations that will be of use to us later, especially in complements Ex and Ax: 
the spherical harmonic addition relations. With this aim in mind, we shall begin by 
introducing and studying the functions of two sets of polar angles Q, and Q,, the 
F(Q; 0,). 


1. The functions (0, : Q,) 


Consider two particles (1) and (2), of state spaces £! and 6? and orbital 
angular momenta L, and L,. We choose for the space £} a standard basis, formed 
by the kets { | py, > |, whose wave functions are: 


dismi 


Prnt) = Rea) YE) (1) 


(Q, denotes the set of polar angles { 0,, g, } of the first particle). Similarly, we 

choose for £? a standard basis, { | p,, ,, m, > } In all that follows, we shall confine 

the states of the two particles to the subspaces &(k,, /,) and &(k,, /,), where k,, 4, 

A, and /, are fixed, and the radial functions R, , (r,) and R,, , (r,) play no role. 
The angular momentum of the total system (1) + (2) is: 


J=L, +L, (2) 


According to the results of chapter X, we can construct a basis of &(k,,/,) @ $(k,, L) 
of eigenvectors | $% > common to J? [eigenvalue J(J + 1)h? Jand J, (eigenvalue Mh). 
These vectors are of the form: 


ti lz 
| pY >= y X < h, da; my, m, | J,M > | Pry tym) »® CAM 
my = -li m>,=—l, 
(3-a) 


the inverse change of basis being given by: 


iy tls 


J 
| Pry .tym(1) ? & | Pky to.m2) > = X y < lis l, ; Mi, M3 | J, M > | oy > 
J= {ly -h| M=-J 
(3-b) 
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Relation (3-a) shows that the angular dependence of the states | HF > is described 
by the functions: 


PQ, ;2,) = > > < li 1,5 m, m, | J,M > Yr(2,) Y72(Q,) (4-a) 


Similarly, relation (3-b) implies that: 


ly +12 J 
Yr (Q) Y7? (Q2) = > X < li, 1,5 m, m3 | J,M > CHAOR ; Q2) (4-b) 


J=|l,-t2| M=-J 


To the observables L, and L, correspond, for the wave functions, differential 
operators acting on the variables Q, = (0,, g, } and Q, = [0,, p, }; in parti- 
cular: 


Liz => 100, (5-a) 
h 0 
L», => ido, (5-b) 


Since, by construction, the ket | 9 > is an eigenvector of J, = L,, + L,,, we can 
write : 


h/ ô 0 
(ae, + Jen, 013 0a P2) = Mh DYO P15 O05 02) (6) 


Similarly, we have : 
J) Oy =A IE +1) — MM + 1) | o¥*1 > (7) 


which implies, with formulas (D-6) of chapter VI taken into account: 


: 0 0 
tipi La ; —_ 
fe E 0, + icot 0, A 


+ gel dar + icot | poso, 01502, P2) 


= VHS +1) — M(M + 1) 0%*1(0,,p,:0, 02) (8) 


2. The functions FT(Q) 


We now introduce the function F? defined by: 
FT(0, p) = FI(Q) = Gy "(Q, = Q; Q, = 2) (9) 


FT is a function of a single pair of polar angles Q = { 6, y }, and can therefore 
characterize the angular dependence of a wave function associated with a single 
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particle, of state space &, and angular momentum L. In fact, we shall see that F7 
is not a new function, but is simply proportional to the spherical harmonic Y”. 
To demonstrate this, we shall show that F? is an eigenfunction of L? and L, 
with the eigenvalues /(/ + 1)h? and mh. We therefore begin by calculating the 
action of L, on F7. According to (9), F7 depends on 8 and o by way of 2, = | 0,,9, } 
and Q, = { 0,, @, ), which are both taken equal to 2. If we apply the differentiation 
theorem for functions of functions, we find : 
aq FTO. 9) 


hj] 0 , 8 |5mmo . 
o Ala jes eioh = 


Relation (6) then yields: 


L, Fi (0, p) = mh F7(6, p) (11) 


which proves part of the result being sought. To calculate the action of L? on F”, 
we use the fact that: 


1 


=> (L,L-+ L-L,) +22 (12) 


Now, by using an argument analogous to the one which enabled us to write (10) 
and (11), relation (8) leads to: 


L,F”(0, 0) = RV Ki +1) — mim + 1) F”+'(0, 9) (13) 


With this, (12) then yields: 


L? Fr(0, p) = ro + 1) — mm — 1)] 


+ [Ul + 1) — mm +1)] + 2m? b FP. p) 
= ll + 1h? FR, p) (14) 


F”, which, according to (11), is an eigenfunction of L, with the eigenvalue mh, 
is therefore also an eigenfunction of L? with the eigenvalue /(/ + 1)h?. Since L? 
and L, form a C.S.C.O. in the space of functions of @ and y alone, FT is necessarily 
proportional to the spherical harmonic Y7. Relation (13) enables us to show 
easily that the proportionality coefficient does not depend on m, and we find: 


FT(O, p) = AL) YTO, o) (15) 


We must now calculate this proportionality coefficient 4(/). To do so, we shall 
choose a particular direction in space, the Oz direction (0 = 0, y indeterminate). 
In this direction, all the spherical harmonics Yj" are zero, except those corres- 
ponding to m = 0 [since Y” is proportional to e'””, they must be zero for the value 
of Y” in the Oz direction to be defined uniquely; to see this, set @ = 0 in (66), (67) 
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and (69) of complement A,,]. When m = 0, the spherical harmonic Y"(@ = 0, ọ) 
is given by [cf. complement A,,,, relations (57) and (60)]: 


21 +1 
Y?(0 = 0, 4) = yl Tr (16) 
Substituting these results into (4-a) and (9), we find: 
FIO = 0.9) = <l h:0,0] 0 >V + UER +1) (17) 
Furthermore, according to (15) and (16): 
+ | 
Fr=%(0 = 0, p) = All) ja (18) 
y 4 
We therefore have: 
l (21, + DL + 1) 
= 2 EA . 19 
MI) ere eao (19) 
3. Expansion of a product of spherical harmonics ; 


the integral of a product of three spherical harmonics 


With (9), (15) and (19) taken into account, relations (4-a) and (4-b) imply 


that: 
mor [ (24 + Deh +1) l = 
¥™Q) = | fee ie pnl 30,0] 40> 
x VY <A bs. my |l, m> YQ) Yr(9) (20) 
and: 


Mees ¡el + 1)(2l, + 1) 
l= -h| m=-0N 4r(21 + 1) 


x <la l,:m,,m,|bm> YQ) (21) 


YT (2) Y 710) = <1,,1,50,0]1,0> 


This last relation (in which the summation over m is actually unnecessary, since the 
only non-zero terms necessarily satisfy m =m, + m,) is called the spherical 
harmonic addition relation*. According to formula (26) of complement B,, the 


* In the particular case in which /, = 1, m, = O[Y?(0. p) x cos 0], it yields formula (35) of 
complement Ay. 
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Clebsch-Gordan coefficient </,,1/,30,0/1/,0> is different from zero only if 
I, + 1, — Lis even. The product Y7i'(Q)Y7?(Q) can therefore be expanded only in 
terms of spherical harmonics of orders : 


f=i,+1, .4,+464-2 .44+4,-4 ,... lt, — L) (22) 


In (21), the parity (— 1) of all the terms of the expansion on the right-hand side is 
thus indeed equal to (— 1)''*”, the parity of the product which constitutes the 
left-hand side. 

We can use the spherical harmonic addition relation to calculate the integral: 


[= | y™(Q) ¥™(Q) ¥™(Q) dQ (23) 


Substituting (21) into (23), we find expressions of the type: 


K(l, m;l,, m3) = | Y7(Q) YT (0) dQ (24) 


which, with the spherical harmonic complex conjugation relations and orthogonality 
relations taken into account [cf. complement Ay,, relations (55) and (45)], are 
equal to: 


K(!, m; lz}, m3) = (= 1)" O11, Om. — ms (25) 


The value of / is therefore: 


mi m2 m3 =n m3 (en T 1\(2!, + 1) 
[rr (2) YO) YEN) a2 = (— 1) Wa 


x €1,,1,:0,0]1,,0> <1,,1,5m,.m,| 1, — m, > (26) 


This integral is, consequently, different from zero only if: 

(1) m, + m, + m, = 0, as could have been predicted directly, since the 

2n 
integral over ọ in (23) is | E ee ec 
0 

(ii) a triangle can be formed with three line segments of lengths /,, /, and /3. 

(iii) 1, + 1, — l is even (necessary for < /,, /,; 0, 0|/;. 0 > to be different 
from zero), that is, if the product of the three spherical harmonics Y, Y 7? and YT 
is an even function (obviously a necessary condition for its integral over all directions 
of space to be different from zero). 


Relation (26) expresses, for the particular case of the spherical harmonics, 
a more general theorem, called the Wigner-Eckart theorem. 
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Complement Dx 
VECTOR OPERATORS : THE WIGNER-ECKART THEOREM 


1. Definition of vector operators; examples 
2. The Wigner-Eckart theorem for vector operators 


a. Non-zero matrix elements of Y in a standard basis 
b. Proportionality between the matrix elements of J and Y inside a subspace € (k, j) 
c. Calculation of the proportionality constant; the projection theorem 


3. Application: calculation of the Landé g, factor of an atomic level 


a. Rotational degeneracy; multiplets 
b. Removal of the degeneracy by a magnetic field; energy diagram 


In complement B,, (cf. §5-b), we defined the concept of a scalar operator: it is an 
operator A which commutes with the angular momentum J of the system under 
study. An important property of these operators was then given (cf. §6-c-B of that 
complement): in a standard basis, { | k, j.m >y y, the non-zero matrix elements 
<k,j,m|A|k’',j', m' > of a scalar operator must satisfy the conditions j = j' and 
m = m'; in addition, these elements do not depend on m*, which allows us to write: 


Ck, jm|A|k,j,m>= a(k, KYO 55S pmp: (1) 


In particular, if the values of k and j are fixed, which amounts to considering the 
“restriction” of A (cf. complement B,, $ 3) to the subspace £ (k, j) spanned by the 
(2j + 1) kets |k, j} my (m= -—j, —j + 1,.., +j) we obtain a very simple 
(2j + 1) x (27 + 1) matrix: it is diagonal and all its elements are equal. 

Now consider another scalar operator B. The matrix corresponding to it in 
the subspace &(k, j) possesses the same property: it is proportional to the unit 
matrix. Therefore, the matrix corresponding to B can easily be obtained from the 
one associated with A, by multiplying all the (diagonal) elements by the same 
constant. We therefore see that the restrictions of two scalar operators A and B 
to a subspace &(k, j) are always proportional. Denoting by P(k, j) the projector 
onto the subspace $(k, j}, we can write this result in the form ** : 


Pk, J) BP(k, j) = alk, j) P(k, j) A P(k, j) (2) 


The aim of this complement is to study another type of operator which 
possesses properties analogous to the ones just recalled : the vector operator. We 
shall see that if V and V’ are vectorial, their matrix elements also obey selection rules, 
which we shall establish. Moreover, we shall show that the restrictions of V and V’ 
to &(k, j) are always proportional: 


P(k, j) VW’ P(k, j) = a(k, j) P(k, j) V Pk, j) (3) 


These results constitute the Wigner-Eckart theorem for vector operators. 


* The proof of these properties was outlined in complement By, We shall return to this point 
in this complement ($3-a) when we study the matrix elements of a scalar Hamiltonian. 

** For two given operators 4 and B, the proportionality coefficient generally depends on the 
subspace &(k, j) chosen; this is why we write A(k, j). 


1048 


www.elsolucionario.net 
VECTOR OPERATORS 


COMMENT: 


Actually, the Wigner-Eckart theorem is much more general. For example, 
it enables us to obtain selection rules for the matrix elements of V between 
two kets belonging to two different subspaces @(k, j) and &(k’, j’), or to 
relate these elements to the corresponding elements of V’. The Wigner-Eckart 
theorem can also be applied to a whole class of operators, of which scalars 
and vectors merely represent special cases: the irreducible tensor operators 
(cf. exercise 8 of complement Gy), which we shall not treat here. 


1. Definition of vector operators ; examples 


In §5-c of complement B,,, we showed that an observable V is a vector if its 
three components V, V, and Y in an orthonormal frame Oxyz satisfy the following 
commutation relations: 


[Lo A =0 (4-a) 
Ue Y] = -iy (4-c) 


as well as those obtained by cyclic permutation of the indices x, y and z. 
To give an idea of what this means, we shall give some examples of vector 
operators. 


(i) The angular momentum J is itself a vector; replacing V by J in formulas (4), 
we simply obtain the relations which define an angular momentum (cf. chap. VI). 

(ii) For a spinless particle whose state space is &,, we have J = L. It is then 
simple to show that R and P are vector operators. We have, for example: 


[L., X] = [YP, — ZP,, X] =0 
[Lo Y] = [- ZP, Y] = ihZ (5) 
[L, Z] = [YP,, Z] = — iħY 


(iii) For a particle of spin S, whose state space is 8, @ 6,, J is given by 
J = L + S. In this case, the operators L, S, R, P are vectors. If we take into account 
the fact that all the spin operators (which act only in &,) commute with the orbital 
operators (which act only in &,), the proof of these properties follows immediately 
from (i) and (ii). 

On the other hand, operators of the type L?, L. S, etc., are not vectors, but 
scalars [cf. comment (i) of complement By,, $5-c]. Other vector operators could, 
however, be constructed from those we have mentioned: R x S, (L . SPP, etc. 

(iv) Consider the system (1) + (2), formed by the union of two systems : (1), 
of state space &,, and (2), of state space &,. If V(1) is an operator which acts 
only in &,, and if this operator is a vector [that is, satisfies commutation 
relations (4) with the angular momentum J, of the first system ], then the extension 
of V(1) into &, ® 8, is also a vector. For example, for a two-electron system, the 
operators L,, R,, S,, etc. are vectors. 
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2. The Wigner-Eckart theorem for vector operators 


a. NON-ZERO MATRIX ELEMENTS OF V IN A STANDARD BASIS 


We introduce the operators V,, V_, J, and J_ defined by: 


V, = y, £ iV 
J =J 4 id, (6) 


Using relations (4), we can easily show that: 


[J Vel = F AY, (7-a) 
[J Vi] = — ¡nv (7-b) 
[J V] = AV, (7-c) 


from which we can deduce the commutation relations of J, and V, : 


LJ... y, | =0 (8-a) 
[J.. Y] = 246, (8-b) 
[Y_. V,] = — 2hY, (8-c) 
[J.V ]=0 (8-d) 


Now consider the matrix elements of V in a standard basis. We shall see that 
the fact that V is a vector implies that a large number of them are zero. First of all, 
we shall show that the matrix elements < k, j, m | V, | k', j, m' > are necessarily 
zero whenever m is different from m’. It suffices to note that V, and J, commute 
[which follows, after cyclic permutation of the indices x, y and z, from relation (4-a) ]. 
Therefore, the matrix elements of V, between two vectors | k, j, m > corresponding 
to different eigenvalues mh of J., are zero (cf. chap. II, $D-3-a-f). 

For the matrix elements < k, j, m | V, | k’, J, m' > of V,, we shall show that 
they are different from zero only if m — m' = + |. Equation (7-c) indicates that: 


JV, = VJ, + RV, (9) 
Applying both sides of this relation to the ket | k’, j’, m' >, we obtain: 
JAVI) mo) = Y, LK moy AV, |k jm y 
= (m + I Va| Rim’) (10) 


This relation indicates that V, |k’, j’, m' > is an eigenvector of J, with the eigen- 
value (m' + 1)h*. Since two eigenvectors of the Hermitian operator J, associated 


* It should not be concluded that V, |k.j,m > is necessarily proportional to |k, j} m + 1). 
In fact, the argument we have given shows only that: 


Vilkjm>= YY ce | kim £1). 
ky 

For us to be able to omit, for example, the summation over j’, it would be necessary for V, to commute 

with J*, which is not generally the case. 
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with different eigenvalues are orthogonal, it follows that the scalar product 
Ck, j,m|V, | kj m > is zero ifm # m +1. 

Summing up, the selection rules obtained for the matrix elements of V are 
as follows: 


Y — Am=m-—-m'=0 (11-a) 
V, == Am =m-—-m = +1 (11-b) 
V=> dm=m-=0"w=-1 (11-c) 


From these results, we can easily deduce the forms of the matrices which represent 
the restrictions of the components of V inside a subspace &(k, /). The one associated 
with V, is diagonal, and those associated with V, have matrix elements only just 
above and just below the principal diagonal. 


b. PROPORTIONALITY BETWEEN THE MATRIX ELEMENTS 
OF J AND V INSIDE A SUBSPACE 4(k, j) 


a. Matrix elements of V, and V_ 


Expressing the fact that the matrix element of the commutator (8-a) between 
the bra < k, j, m + 2 | and the ket | k, j, m > is zero, we have: 


Ck jm+2| JIU |k,jm =< k,jm+2|V,J,|k,j,m> (12) 


On both sides of this relation and between the operators J, and V, , we insert the 
closure relation : 


Y [kim > Ck im | = I (13) 


kfm’ 


We thus obtain the matrix elements < k,j,m|J, |k',j’,m’ > of J, ; by the very 
construction of the standard basis { | k, j, m > }, they are different from zero only 
ifk = k',j =j and m =m' + 1. The summations over k’, j and m' are therefore 
unnecessary in this case, and (12) can be written: 


Chkyimt+2|J,|kKimt+ 1 Ck im + tl V, | kim) 
=Ckjmt+2|VW IK Amtl>< ck ¿mes 113, [Aim (14) 


that is: 


Chim +1 k |kim _ Chim +2/V, [kim +1 


<k jm +1jJa|kjimy  <k,jm+2|J,|k,j,m +1) (15) 


(as long as the bras and kets appearing in this relation exist, that is, as long as 
j—-2>m> — j, we can show immediately that neither of the denominators can 
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go to zero). Writing the relation thus obtained for m = — j, —j + l, ...,j — 2, 
we get: 
<k,j}-j+1 V4 St Net ALS eet ae kj, = +1) _ 
<k} =j +i|Jijkj -jy <kj=j+2[J,|k,j=j+1> 
_Ckjm+1|V,|k jm) _ 
—<k jm +1 |J,|k,jm> ” 
AAA IED: 
<k, jj, |kjj-= 1> 


that is, if we call a, (k, j) the common value of these ratios: 
Ckijm+1]1V, |k,jm> =0,(k,))<k,jm>+1|J, |k,j,m> (17) 


where a, (k, j) depends on k and on j, but not on m. 

In addition, selection rule (11-b) implies that all the matrix elements 
< k,jım | V, | k,jm' >and<k,j,m|J, | k,j,m' »arezeroif4m =m —m'¢ +1. 
Therefore, whatever m and m’, we have: 


(k)jm|V,|k, jm) =09,(k,)<k,j m|J,|k,j m > (18-a) 


This result expresses the fact that all the matrix elements of V, inside &(k, j) are 
proportional to those of J,. 

An analogous argument can be made by taking the matrix element of the 
commutator (8-d) between the bra < k, j, m — 2 | and the ket | k, j, m > to be zero. 
We are thus led to: 


< k,j,m|V_ [kim > = a_(kf\Ck,j,m|J_ |k. jm (18-b) 


an equation which expresses the fact that the matrix elements of V_ and J_ inside 
&(k, j) are proportional. 


B. Matrix elements of V, 


To relate the matrix elements of Y, to those of J,, we now place relation (8-c) 
between the bra < k, j, m | and the ket | k, j, m X: 


— 2h <¢k,j,m|V,|k,j,m> 
= <k, j,m| (J-V, - V,J_)|Kk,j,m> 
=hvV/j(j +1) — mm +1)<k jm +1|V,|k,jm> 
AVG +1) — mim— 1)<k,jm|V,|k,jm— r> (19) 
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Using (18-a), we get : 
<k,j,m| V_|k, j,m> 


= NA +1) —= mm +1) Ck jm +1|J,|k,j,m> 


~ Jj +1) mim — D<kjm|J,|kjm-1>} 
= EDAD mm +1) — j +1) + mm = 1)} (20) 


that is: 

<k,j m| V¿|k,j,m)> = mh a, (k, j) (21) 
Similarly, an analogous argument based on (8-b) and (18-b) leads to: 

< k,j,m | V,|k,j,m> = mh a_(k, j) (22) 


Relations (21) and (22) show that a, (k, j) and a_ (k, j) are necessarily equal; from 
now on, we shall call their common value a(k, j): 


alk, j) = x, (k, j) = a(k, J) (23) 
In addition, these relations imply that: 


< k,j,m k)j mo) = alk, j) <k, j, m| J, | k, j, m > (24) 


V, 


Y. Generalization to an arbitrary component of V 


Any component of V is a linear combination of V,, Y. and Y. Consequently, 
using relation (23), we can summarize (18-a), (18-b) and (24) by writing: 


Ck, jm|V[ kim > = alk, j)<k,jm|J |k, j, m > (25) 


Therefore, inside &(k, j), all the matrix elements of V are proportional to those of J. 
This result expresses the Wigner-Eckart theorem, for a special case. Introducing 
the “restrictions” of V and J to &(k, j) (cf. complement B,,, $3), we can also write it : 


Pik, j) V Plk, j) = alk, j) P(k, j) J P(k, j) (26) 


COMMENT: 


J commutes with P(k, j) Lef. (27)]; since, moreover 
[P(k, 5117 = P(k, j) 


we can omit either one of the two projectors P(k, j) on the right-hand side 
of (26). 
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c. CALCULATION OF THE PROPORTIONALITY CONSTANT: 
THE PROJECTION THEOREM 


Consider the operator J . V; its restriction to &(k, j) is P(k, j)J . VP(k, j). 
To transform this expression, we can use the fact that: 


[J, P(k, j)] = 0 (27) 


a relation that can easily be verified by showing that the action of the commutators 
[Jz P(k, j)] and [J,, P(k, /)] on any ket of the { |k, j, m > } basis yields zero. 
Using (26), we then get: 


P(k, 3. V P(k,j) 


J. [P(k. j) V P(k, j)] 
a(k, j) J P(k, j) (28) 
alk, j) jU + Ih? P(k, j) 


The restriction to the space &(k, j) of the operator J . V is therefore equal to the 
identity operator * multiplied by a(k, /)j(j + !)h?. Therefore, if Wy; > denotes an 
arbitrary normalized state belonging to the subspace &(k,/), the mean value 
<J.V >, of J. V is independent of the ket | y, ; > chosen, since: 


«J VOR =< Wij | J.V | Wij? = alk, IU + 1)h? (29) 


If we substitute this relation into (26), we see that, inside the subspace &(k, j)**: 


e = SUEY alg (30) 
<J? Pej ii + Whe 


This result is often called the “projection theorem”. Whatever the physical 
system being studied, as long as we are concerned only with states belonging to 
the same subspace &(k, j), we can assume that all vector operators are proportional 
to J. 

We can give the following classical physical interpretation of this property : 
if j denotes the total angular momentum of any isolated physical system, all the 
physical quantities attached to the system rotate about j, which is a constant vector 
(cf. fig. 1). In particular, for a vector quantity v, all that remains after averaging 
over time is its projection y, onto j, that is, a vector parallel to j, given by : 


vy, = Jey; (31) 


a formula which is indeed analogous to (30). 


* Since J. V is a scalar, the fact that its restriction is proportional to the identity operator was 
to be expected. 

** We shall say that an operator relation is valid only inside a given subspace when it is actually 
valid only for the restrictions of the operators being considered to this subspace. To be completely 
rigorous, we should therefore have to place both sides of relation (30) between two projectors P(k, j). 
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FIGURE 1 


Classical interpretation of the projection theorem : since the vector y 
rotates very rapidly about the total angular momentum j, only its 
static component y should be taken into account. 


COMMENTS: 


It cannot be deduced from (30) that, in the total state space [the direct sum 
of all the subspaces £(k, j)], V and J are proportional. It must be noted that 
the proportionality constant a(k, j) (or <J.V >, ¡) depends on the sub- 
space 6(k, j) chosen. Moreover, any vector operator V may possess non-zero 
matrix elements between kets belonging to different subspaces &(k, j), while 
the corresponding elements of J are always zero. 

Consider a second vector operator W. Its restriction inside &(k, j) is propor- 
tional to J, and therefore also to the restriction of V. Therefore, inside 
a subspace &(k, j), all vector operators are proportional. 


However, to calculate the proportionality coefficient between V and W, we cannot 
simply replace J by W in (30) (which would give the value < V.W >, ;/< W? >, ;). In 
the proof leading to relation (30), we used the fact that J commutes with P(k, /) in (28). 
which is not generally the case for W. To calculate this proportionality coefficient 
correctly, we note that, inside the subspace &(k, j): 


J.W >; 
vad Pei y (32) 
SE 
This yields, with (30) taken into account: 
Vo, 
2 INV Obs (33) 
<I. We 


Application: calculation of the Landé g, factor 
of an atomic level 


In this section, we shall apply the Wigner-Eckart theorem to the calculation 


of the effect of a magnetic field B on the energy levels of an atom. We shall see that 
this theorem considerably simplifies the calculations and enables us to predict, in 
a very general way, that the magnetic field removes degeneracies, causing 
equidistant levels to appear (to first order in B). The energy difference of these 
states is proportional to B and to a constant g, (the Landé factor) which we shall 
calculate. 


Let L be the total orbital angular momentum of the electrons of an atom 


(the sum of their individual orbital angular momenta L,), and let S be their total 
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spin angular momentum (the sum of their individual spins S,). The total internal 
angular momentum of the atom (assuming the spin of the nucleus to be zero) is: 


J=L+S (34) 


In the absence of a magnetic field, we call A, the Hamiltonian of the atom; 
Ho commutes with J*. We shall assume that Ho, L?, S?, J? and J, form a C.S.C.O., 
and we shall call | Eo, L, S, J, M > their common eigenvectors, of eigenvalues Eo, 
L(L + 1)h?, S(S + 1)h?, J(J + 1)h? and Mh, respectively. 


This hypothesis is valid for a certain number of light atoms for which the angular momentum 
coupling is of the L.S type (cf. complement B,,,). However, for other atoms, which have 
a different type of coupling (for example, the rare gases other than helium), this is not the case. 
Calculations based on the Wigner-Eckart theorem, similar to those presented here, can then 
be performed, and the central physical ideas remain the same. For the sake of simplicity, we shall 
confine ourselves here to the case in which L and S are actually good quantum numbers for the 
atomic state under study. 


a. ROTATIONAL DEGENERACY; MULTIPLETS 


Consider the ket J, | Eo, L, S, J, M >. According to the hypotheses set forth 
above, J, commutes with H,; therefore, J, | Eo, L, S, J, M > is an eigenvector 
of H, with the eigenvalue E,. Furthermore, in accordance with the general properties 
of angular momenta and their addition, we have: 


Je | Eo L, S, J, M > = AV J(J + 1) — M(M + 1)|E,,L,8,J,M +1) (35) 


This relation shows that, starting with a state | Eo» L, S, J, M >, we can 
construct others with the same energy : those for which — J < M < J. It follows 
that the eigenvalue E, is necessarily at least (2J + 1)-fold degenerate. This is an 
essential degeneracy, since it is related to the rotational invariance of Ho (an 
accidental degeneracy may also be present). In atomic physics, the corresponding 
(2J + 1)-fold degenerate energy level is called a multiplet. The eigensubspace 
associated with it, spanned by the kets | Ey, L, S, J, M > with M = J, J —1,..., — J, 
will be written £ (Ep, L, S, J). 


b. REMOVAL OF THE DEGENERACY BY A MAGNETIC FIELD; 
ENERGY DIAGRAM 


In the presence of a magnetic field B parallel to Oz, the Hamiltonian becomes 
(cf. complement D,,,;): 


H = Ho + H, (36) 


* This general property follows from the invariance of the energy of the atom under a rotation 
of all the electrons, performed about an axis passing through the origin (which is the position of the 
nucleus, assumed to be motionless). H,, which is invariant under rotation, therefore commutes with J 
(A, is a scalar operator; cf. complement By, $5-b). 
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with: 
A, = 0,(L, + 28,) (37) 


(the factor 2 before S, arises from the electron spin gyromagnetic ratio). The 
“Larmor angular frequency” w, of the electron is defined in terms of its mass m 
and its charge q by: 


MES a H (38) 


(where #g = qh/2m is the Bohr magneton). 

To calculate the effect of the magnetic field on the energy levels of the atom, 
we shall consider only the matrix elements of A, inside the subspace (Eo, L, S, J) 
associated with the multiplet under study. Perturbation theory, which will be 
explained in-chapter XI, justifies this procedure when B is not too large. 

Inside the subspace $(£,, L, S, J), we have, according to the projection 
theorem (§ 2-c): 


2 < L.J DELS, J (39-a) 


HI +17? 


S [S-J uss J (39-b) 


NJ +1}? 


where < L.J >e z.s, and <S. J >g zs, denote respectively the mean values of 
the operators L. J and S . J for the states of the system belonging to $ (Eo, L, S, J). 
Now, we can write: 


L.J =L.(L +$) = L? +5 (8 — L — S?) (40-a) 
as well as: 
S.J = S.(L + S) = S? +3 (8 — L? - $?) (40-b) 


It follows that: 
2 
CL. I des = UL + DA? +Ë pU +1) - UL +1) — S(S +1)] 
(41-a) 
and: 


(Srs = SS + 1h? + [J(J +1) - LL +1) — S(S +1)] 
(41-b) 


Relations (41), substituted into (39) and then into (37), show that, inside the 
subspace $(£,, L, S, J), the operator H, is given by: 
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where the Landé g, factor of the multiplet under consideration is equal to: 


3 S(S +1) — LIL +1) 
= 3 STF (43) 


Relation (42) implies that the eigenstates of H, inside the eigensub- 
space &(E,, L, S, J) are simply the basis vectors | Ey, L, S, J, M >», with the 
eigenvalues: 


E,(M) = g,¡Mho, (44) 


We see that the magnetic field completely removes the degeneracy of the multiplet. 
As is shown by the diagram in figure 2, a set of (2J + 1) equidistant levels appears, 
each one corresponding to one of the possible values of M. Such a diagram permits 
generalization of our earlier study of the polarization and frequency of optical 
lines emitted by a fictitious atom with a single spinless electron (the “normal” 


Zeeman effect; cf. complement D,,,), to the case of atoms with several electrons 
whose spins must be taken into account. 


M E 
5 
2 
3 
2 
/ 1 
Pit 2 


Z 


Energy diagram showing the removal of the (27 + |)-fold degeneracy of a multiplet (here J = 5/2) 
by a static magnetic field B. The distance between two adjacent levels is proportional to [B| and to 
the Landé g, factor. 


ula nle pl — 


FIGURE 2 


References and suggestions for further reading : 


Tensor operators : Schiff (1.18), $28; Messiah (1.17), chap. XIH, $VI; Edmonds 
(2.21), chap. 5; Rose (2.19), chap. 5; Meijer and Bauer (2.18), chap. 6. 
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Complement Ex 
ELECTRIC MULTIPOLE MOMENTS 


1. Definition of multipole moments 


a. Expansion of the potential on the spherical harmonics 
b. Physical interpretation of multipole operators 
c. Parity of multipole operators 
d. Another way to introduce multipole moments 
2. Matrix elements of electric multipole operators 
a. General expression for the matrix elements 
b. Selection rules 
c. Physical consequences 


Consider a system .4 composed of N charged particles placed in a given electrostatic 
potential U(r). We shall show in this complement how to calculate the interaction 
energy of the system ¥ with the potential U(r) by introducing the electric multipole 
moments of 4. First of all, we shall begin by recalling how these moments are 
introduced in classical physics. Then we shall construct the corresponding quantum 
mechanical operators, and we shall see how, in a large number of cases, their use 
considerably simplifies the study of the electrical properties of a quantum mechanical 
system. This is because these operators possess general properties which are inde- 
pendent of the system being studied, satisfying in particular certain selection rules. 
For example, if the state of the system .4 being studied has an angular momentum j 
[i.e. is an eigenvector of J? with the eigenvalue j(j + 1)h?], we shall see that the 
mean values of all multipole operators of order higher than 2; are necessarily zero. 


1. Definition of multipole moments 


a. EXPANSION OF THE POTENTIAL 
ON THE SPHERICAL HARMONICS 


For the sake of simplicity, we begin by studying a system composed of 
a single particle, of charge q and position r, placed in the potential U(r). We shall 
then generalize the results obtained to N-particle systems. 
a. Case of a single particle 

In classical physics, the potential energy of the particle is: 

V(r) = qUir) (1) 


Since the spherical harmonics form a basis for functions of 0 and q, we can 
expand U/(r) in the form: 


Ue) =Y Y ha YT0.0) e) 
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We shali assume the charges creating the electrostatic potential to be placed 
outside the region of space in which the particle being studied can be found. In 
this whole region, we then have: 


AU(r) = 0 (3) 


Now, we know [cf. relation (A-15) of chapter VII] that the Laplacian A is related 
to the differential operator L? acting on the angular variables 0 and ọ by: 


(4) 


Also, the very definition of the spherical harmonics implies that: 
L*Y7(0, p) = IU + 1)h?Y7(0, y) (5) 


It is therefore easy to calculate the Laplacian of expansion (2), If we write, using (3), 
that each of the terms thus obtained is zero, we get : 


ES ¿Cs hill) = 0 T (6) 


This equation has two linearly independent solutions, r’ and r” “*D, Since U(r) is 
not infinite for r = 0, we must choose: 


E (7) 


where the c,,, are coefficients which depend on the potential under consideration 


(the factor V 4r/(2l + 1) is introduced for convenience, as will be seen later). 
We can therefore write (2) in the form: 


20 l 
Vir) = aUe) = È Y Cim Br) (8) 
1=0 m=-1 
where the functions 27'(r) are defined by their expressions in spherical coordinates : 


A 


m dr m 
a) =a j a YF 0) 0) 


In quantum mechanics, the same type of expansion is possible; the potential 
energy operator of the particle is V(R) = gU(R), whose matrix elements in the 
{|r > } representation are (cf. complement By, §4-b): 


<r|qU(R)|r> = qUír) (r — r') (10) 
Expansion (8) then yields : 


V(R) = qU(R) = 


20 
l= 


l 
Y i Cim Qr (11) 


o 

3 
li 
! 
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where the operators O are defined by: 
Cr] Qr |r > = 270) ôe — r’) 
= 4 JI T” YT, o) öt 1) (12) 
The Q7 are called “electric multipole operators”. 


PB. Generalization to N particles 


Now consider N particles, with positions r}, r3, ..., ry and charges q,, q», -».> Yy- 
Their coupling energy with the external potential U(r) is: 


Vr), Pa)... Ty) = y qnU(r,,) (13) 


The argument of the preceding section can immediately be generalized to show 
that : 


20 l 
V(r Ea, sry) = Y, a Cim Sr Wis Fa, c Fy) (14) 


where the coefficients c,,, [which depend on the potential U(r)] have the same 
values as in the preceding section, and the functions 27 are defined by their values 
in polar coordinates : 


m 4r N l m 
DE], Fa, ey Ey) = MEL 2 Qn (Yn) YTO Pn) (15) 


(0, and y, are the polar angles of r,). The multipole moments of the total system 
are therefore simply the sums of the moments associated with each of the particles. 

Similarly, in quantum mechanics, the coupling energy of the N particles with 
the external potential is described by the operator : 


20 


V(R,, R,, ney Ry) = È ys Cm Qr (16) 


1=0 m=-1 


with: 


L Vie hie ce Fy | QF | ri, ri, ye 
= DE, 0, ry) ôl, — ri) Ó(r, — 15)... d(ty — ry) (17) 


b. PHYSICAL INTERPRETATION OF MULTIPOLE OPERATORS 


a. The operator Q}; the total charge of the system 


Since Y is a constant (Y$ = 1 WN 4), definition (15) implies that: 


N 
B= Y h (18) 
n=1 
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The operator Q? is therefore a constant which is equal to the total charge of the 
sysiem. 

The first term of expansion (14) therefore gives the coupling energy of the 
system with the potential U(r), assuming all the particles to be situated at the 
origin O. This is obviously a good approximation if U(r) does not vary 
very much in relative value over distances comparable to those separating the 
various particles from O (if the system 4 is centered at O, this distance is of the 
order of the dimensions of ). Furthermore, there exists a special case in which 
expansion (14) is rigorously given by its first term : the case where the potential U(r) 
is uniform, and therefore proportional to the spherical harmonic / = 0. 


p. The cperators Q%; the electric dipole moment 


According to (15) and the expression for the spherical harmonics Yọ 
[cf. complement A,,, equations (32)], we have: 


1 
A —)Y Qn (x, aie iya) 
2 n 


go — = 
< 2 =} qn Zn 
n 


= 
Í 


1 
| a," = hd (x, — iVn) (19) 


These three quantities can be considered to be the components of a vector on the 
complex basis of three vectors e,, e, and e_,: 


J= — 2'e, + Be — Bie_, (20) 
with : 
e = — EN (e, tie): e =e e, = = (e, — ie,) (21) 
1 x J> êo z= €_ =(e, — le 
Ja y 1 WE y 


(where e,, e,, e, are the unit vectors of the Ox, Oy and Oz axes). The components 
of this vector 2 on the Oxyz axes are then: 


1 


2 EVA =; | = 2 nn 


2, =e ee + 211 = Las 
2 n 
2 = 2 = Y qZ, (22) 


We recognize the three components of the total electric dipole moment of the 
system Y with respect to the origin O: 


I= Y wt, (23) 


1062 


www.elsolucionario.net 


ELECTRIC MULTIPOLE MOMENTS 


The operators Q” are therefore actually the components of the electric dipole 
D= y q, R,. 


Relations (19) enable us, moreover, to write the / = 1 terms of expansion (14) 
in the form: 


q ym 1 
2, Chm 2) = “p ES Cia) È dn Xn 


i 
era +e Dan Yy + Cio de z, (24) 


We shall now show that the combinations of the coefficients c, ,, which appear in 
this expression are none other than the components of the gradient of the 
potential U(r) at r = 0. If we take the gradient of expansion (8) of U(r), the / = 0 
term (which is constant) disappears; the /= 1 term can be put into a form 
analogous to (24) and gives: 


[VU(r)], <9 = gl — €, -1)€, gl + (1-1), Cioe, (25) 


As for the / > 1 terms of (8), they are polynomials in x, y, z of degree higher than | 
(cf. Sy and ò below) which make no contribution to the gradient at r = 0. 
The / = 1 term of expansion (14) can therefore be written, using (23) and (25): 


(x ists) (WU) o = - I.8 =0) 26) 


where: 
8 (r) = —VU(r) (27) 


is the electric field at the point r. Thus we recognize (26) as the well-known expression 
for the coupling energy between an electric dipole and the field £. 


COMMENTS: 


(i) In physics, we often deal with systems whose total charge is zero (atoms, 
for example). 29 is then equal to zero, and the first multipole operator entering 
into expansion (14) is the electric dipole moment. This expansion can often 
be limited to the / = | terms [hence expression (26) |, since the terms for 
which / > 2 are generally much smaller (this is the case, for example, if the 
electric field varies little over distances comparable to the distances of the 
particles from the origin; the / > 2 terms are, furthermore, rigorously zero in a 
special case : the case in which the electric field is uniform [ cf. $8y and 6 below)|]. 
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(ii) For a system F composed of two particles of opposite charge + q and — q 
(an electric dipole), the dipole moment 2 is: 


D = qr, — r,) (28) 


Its value, which is related to the position of the “relative particle” 
(cf. chap. VII, §B) associated with the system S, therefore does not depend 
on the choice of the origin O. Actually this is a more general property : it is 
simple to show that the electric dipole moment of any electrically neutral 
system S is independent of the origin O chosen. 


Y. The operators QF ; the electric quadrupole moment 


Using the explicit expression for the Y} [ cf. complement A,,,, relations (33)]), 
we could show without difficulty that : 


23? = =. (x, + iv.) 

_ V6 , 
a =F sen Qn EnlXp E Ya) (29) 
2 =>E a, (322 — rè) 


In this way, we obtain the five components of the electric quadrupole moment of 
the system F. While the total charge of is a scalar, and its dipole moment is a 
vector J, it can be shown that the quadrupole moment is a second-rank tensor. 
In addition, an argument similar to the one in $ B would enable us to write 
the / = 2 terms of expansion (14) in the form: 


+2 1 8U N o. 
o], La (30) 
2 ij 


Pot Ox! Ax! |. zp n= 
(with x‘, x/ = x, y or z). These terms describe the coupling between the electric 


quadrupole moment of the system FP and the gradient of the field Ê (r) at the 
pointr = 0. 


ô. Generalization: the electric l-pole moment 


We could generalize the preceding arguments and show from the general 
expression for the spherical harmonics [cf. complement Ay,, relations (26) or (30)] 
that: 

— the quantities 27 are polynomials (which are homogeneous in x, y and z) 
of degree /. 

— the contribution to expansion (14) of the / terms involves /th order 
derivatives of the potential U(r), evaluated at r = 0. 

Expression (14) for the potential can thus be seen to be a Taylor series 
expansion in the neighborhood of the origin. As the variation of the potential U(r) 
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in the region about Z becomes more complicated, higher order terms must be 
retained in the expansion. For example, if U(r) is constant, we have seen that 
the / = 0 term is the only one involved. If the field 8 (r) is uniform, the / = 1 terms 
must be added to the expansion. If it is the gradient of the field Ê which is uniform, 
we must have / < 2, and so forth. 


c. PARITY OF MULTIPOLE OPERATORS 


Finally, we shall consider the parity of the O. We know that the parity 
of Y is (— 1) [cf. chap. VI, relation (D-28)]. Therefore (cf. complement Fj, 
§2-a), the electric multipole operator Q7 has a definite parity, equal to (— 1), 
independent of m. This property will prove useful in what follows. 


d. ANOTHER WAY TO INTRODUCE MULTIPOLE MOMENTS 


We shall consider the same system of N charged particles as in $1-a. How- 
ever, instead of considering the interaction energy of this system with a given 
external potential U(r), we shall try to calculate the potential W(p) created by these 
charges at a distant point p (cf. fig. 1). For the sake of simplicity, we shall use classical 
mechanics to treat this problem. The potential W(p) is then: 


we = 5 An (31) 


= 4né, ¿4 |p — r 


Now, when |p| > |r,|, it can be shown that: 


E (E) ies M 


FIGURE 1 


The potential W (p) created at a distant point by a system Y 
composed of N charged particles (of positions r,, r2, ...) 
can be expressed in terms of the multipole moments of . 


where o, denotes the angle (p, r,), and P, is the /th-order Legendre polynomial. 
Using the spherical harmonic addition theorem (cf. complement A,,, §2-e-y), we 
can write : 


P (cos a.) = Y) (- 1)" YTO, Pn) ¥T(O, ®) (33) 
(where O and @ denote the polar angles of p). Substituting (32) and (33) into (31), 
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we finally obtain: 


1a [4m eel 
(— 1) 2," 


Wie) = - ¥7(0, ®) (34) 


Ane, > m=-1 y 21 +1 


where 27'(r,,r,, ..., ry) is defined by relation (15). 

Relation (34) shows that the specification of the 27 perfectly defines the 
potential created by the particle system in regions of space outside the system Y. 
This potential W(p) can thus be seen to be the sum of an infinite number of terms: 


(i) The / = 0 term gives the contribution of the total charge of the system. 
This term is isotropic (it does not depend on O and 9) and can be written: 


Wolp) S Gn (35) 


=n PG 


This is the I/p potential which would be created by the charges if they were all 
situated at O. It is zero if the system is globally neutral. 


(ii) The / = 1 term gives the contribution of the electric dipole moment 2 of 
the system. By performing transformations analogous to those in $b-PB, it can be 
shown that this contribution can be written: 

1 2. 
W,(p) = (36) 


Ame, p? 


This potential decreases like 1/p? when p increases. 

(iii) The / = 2, 3, ... terms give, in the same way, the contributions to the 
potential W(p) of the successive multipole moments of the system under study. 
When p increases, each of these contributions decreases like 1/p'**, and its angular 
dependence is described by an /th-order spherical harmonic. Moreover, we see 
from (34) and definition (15) that the potential due to the multipole moment 2, 
is at most of the order of magnitude of W,(p) x (d/p)', where d is the maximum 
distance of the various particles of the system S from the origin. Therefore, if we 
are concerned with the potential at a point p such that p > d (the potential at a 
distant point), the W,(p) terms decrease very rapidly when / increases, and we do 
not make a large error by retaining only the lowest values of / in (34). 


COMMENT: 


If we wanted to calculate the magnetic field created by a system of 
moving charges, we could introduce the magnetic multipole moments of the 
system in an analogous way : the magnetic dipole moment*, the magnetic 
quadrupole moment, etc... The parities of the magnetic moments are the 
opposite of those of the corresponding electric moments: the magnetic dipole 
moment is even, the magnetic quadrupole moment is odd, and so on. This 
property arises from the fact that the electric field is a polar vector while the 
magnetic field is an axial vector. 


* There is no magnetic multipole moment of order / = 0 (magnetic monopole). This result is 
related to the fact that the magnetic field, whose divergence is zero according to Maxwell's equations, 
has a conservative flux. 
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2. Matrix elements of electric multipole moments 


We shall again consider, for the sake of simplicity, a system composed of a 
single spinless particle. However, generalization to N-particle systems presents no 
theoretical difficulty. 

The state space 6, of the particle is spanned by an orthonormal basis, 
{| Xn.tm > } Of common eigenvectors of L? [eigenvalue /(/ + 1)h?] and L, 
(eigenvalue mh). We shall evaluate the matrix elements of a multipole operator OQ” 
in such a basis. 


a. GENERAL EXPRESSION FOR THE MATRIX ELEMENTS 


a. Expansion of the matrix elements 


From the general results of chapter VII, we know that the wave functions 
associated with the states | y, ım > are necessarily of the form: 


dn.tm(¥) e Ro Ar) YT (6, p) (37) 


The matrix element of the operator Q7 can therefore be written, using (12): 
< Any dim: | Qr | An dama > = 


-| y? ar | sin 0 aof de ZE n.m (r 0, o) 270, 0,0) Ln, 1, l 0, 0) 
0 0 16) 


Tr 


= 2 dr R* . y pl ino 
NET 7 i ro dr RE ar) Rat) | sin 0 d0 


oO 
x | do Yr'*(0, o) YT(0. p) YT2(0, p) (38) 
0 


Thus, in the matrix element under consideration, we have a radial integral and an 
angular integral. The latter, furthermore, can be simplified; using the complex 
conjugation relation for the spherical harmonics [cf. chap. VI, relation (D-29)] 
and relation (26) of complement C, (the Wigner-Eckart theorem for spherical 
harmonics), we can show that it can be written: 


(= ym | sin 0 a | de Y7™(0, p) Y”(0, p) Y™(0. p) = 
0 0 

_ [2 +12, +1) 

4n(2l, +1) 


€1,,1;0.0|6,,0> ¢ 1, bm, m] l,m, > (39) 


Finally, we obtain: 


< Žnidim | Or | nz dom > = 


l : 
== Fats Qu! Znats >< boo Fs mg, m| d, m > (40) 


Ya, + 
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Where the “reduced matrix element” < Xan | Qi || Xn, > of the /th-order electric 
multipole operator is defined by: 


Xn, lOl Xan > = 42h, + 1,150, 0] 1,.0) 


x | dr r'*? Rx, (r) Repl) (41) 
o 


Relation (40) expresses, in the particular case of electric multipole operators, 
a general theorem whose application in the case of vector operators has already 
been illustrated (cf. complement Dy): the Wigner-Eckart theorem. 


COMMENT: 


We have confined ourselves here to a system Z composed of a single 
spinless particle. Nevertheless, if we consider a system of N particles which 
may have spins, we can generalize the results we have obtained. To do so, 
we must introduce the total angular momentum J of the system (the sum 
of the orbital and spin angular momenta of the N particles), and denote 
by | Xn.;m > the eigenvectors common to J? and J,. We can then derive a rela- 
tion similar to (40), in which /, and /, are replaced by j, and j, (cf. comple- 
ment G,, exercise 8). However, the quantum numbers /,, 7,, m, and m, can 
then be either integral or half-integral, depending on the physical system being 
considered. 


B. The reduced matrix element 


The reduced matrix element < Zn, ,, || Q: || %,,.., > is independent of m, m, 
and m,. It involves the radial part R, (r) of the wave functions X, ¡ m(r, 9, @). Its 
value therefore depends on the { | x, ¡ , > } basis chosen, and general properties 
can hardly be attributed to it. However, it can be noted that the Clebsch-Gordan 
coefficient < /,,/;0,0 | 1,,0 > involved in (41)iszero if/, + /, + /is odd (cf: comple- 
ment B,, $3-c); this implies that the reduced matrix element has the same property. 


COMMENT: 


This property is related to the (— 1)! parity of the electric multipole 
operators Of. For the magnetic multipole operators, we have already 
pointed out that their parity is (— 1)'*?; therefore it is when /, + /, + / 
is even that their matrix elements are zero. 


y. The angular part of the matrix element 


In (40), the Clebsch-Gordan coefficient < /,, /; m,, m | l,, m; > arises solely 
from the angular integral appearing in the matrix element of Q7 [cf. (38)]. This 
coefficient depends only on the quantum numbers associated with the angular 
momenta of the states being considered and does not involve the radial depen- 
dence R,, ,(r) of the wave functions. This is why it appears in the matrix elements 
of multipole operators whenever one chooses a basis of eigenvectors common 
to L? and L, (or J* and J, for a system of N particles which may have spins; 
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cf. comment of §« above). Now, we know that such bases are frequently used in 
quantum mechanics, and, in particular, that the stationary states of a particle in a 
central potential W(r) can be chosen in this form. The radial functions R, ,(r) 
associated with the stationary states thus depend on the potential W(r) chosen; 
this is therefore also true for the reduced matrix element < Xp, 1, || Qi || Xn. >. On the 
other hand, this is not the case for the angular dependence of the wave functions, 
and the same Clebsch-Gordan coefficient appears for all W(r); this is why it plays 
a universal role. 


b. SELECTION RULES 


According to the properties of Clebsch-Gordan coefficients (cf. comple- 
ment By, §1), < 1,,/; m,, m | l. m, > can be different from zero only if we have 
both: 

m =m,+m (42) 
Therefore, relation (40) implies that if at least one of these conditions is not met, 
the matrix element < Xn, tm | QT | %o,.1,m, > is necessarily zero. We thus obtain 
selection rules which enable us, without calculations, to simplify considerably our 
search for the matrix which represents any multipole operator Q". 

Furthermore, we saw in $ 2-a-f that the reduced matrix element of a multipole 
operator obeys another selection rule: 


— for an electric multipole operator : 
l + /, + / = an even number (44-a) 


— for a magnetic multipole operator: 


i, + L + / = an odd number (44-b) 
c. PHYSICAL CONSEQUENCES 
a. The mean value of a multipole operator in a state of well-defined angular 
momentum 


Assume that the state | y > of the particle is one of the basis states 
| Xns.t;m, >- The mean value < Q7 > of the operator Q/' is then: 


COP > = Caste | QT | Lati ma > (83) 
Conditions (42) and (43) are written here : 

m=0 (46) 

0</< 21, (47) 


Thus we obtain the following important rules: 
— the mean values, in a state | Ly, 1, m, >» Of all the operators Q are zero 
ifm #0: 


<Q7>=9 if m0 (48) 
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— the mean values, in a state | Xn i,m, >s Of all operators of order | higher 
than 21, are zero: 


<Q7T> =90 if /> 24 (49) 


If we now assume that the state | y >, instead of being a state | Zn, 1, m, >> 
is any superposition of such states, all corresponding to the same value of /,, it is not 
difficult to show that rule (49) remains valid [but not rule (48), since, in general, 
matrix elements for which m, 4 m, then contribute to the mean value < Q7 >]. 
Relation (49) is therefore a very general one and can be applied whenever the 
system is in an eigenstate of L°. 

Furthermore, relations (44) imply that the mean value of an /th-order 
multipole operator can be different from zero only if : 


— for an electric multipole operator : 

/ = an even number (50-a) 
— for a magnetic multipole operator: 

/ = an odd number (50-b) 


The preceding rules enable us to obtain, conveniently and without calculations , 
some simple physical results. For example, in an / = 0 state (like the ground state 
of the hydrogen atom), the dipole moments (electric or magnetic), quadrupole 
moments (electric or magnetic), etc. are always zero. For an / = | state, only the 
Oth-, 1st- and 2nd-order multipole operators can be non-zero; parity rules (50) 
indicate that they are the total charge and electric quadrupole of the system, as 
well as its magnetic dipole. 


COMMENT: 


The predictions obtained can be generalized to more complex systems 
(many-electron atoms for éxample). If the angular momentum of such a system 
is j (integral or half-integral) one can show that it suffices to replace, in (49), 
i, by /. 

We shall apply, for example, rules (49) and (50) to the study of the 
electromagnetic properties of an atomic nucleus. We know that such a nucleus 
is a bound system composed of protons and neutrons, interacting through 
nuclear forces. If, in the ground state*, the eigenvalue of the square of the 
angular momentum is Y(/ + 1)h?, the quantum number / is called the nuclear 
spin. The rules we have stated indicate that: 


— if /=0, the electromagnetic interactions of the nucleus are 
characterized by its total charge, all the other multipole moments being zero. 
This is the case, for example, for *He nuclei (*«-particles”), 7°Ne nuclei, etc. 


* In atomic physics, one generally consider the nucleus to be in its ground state: the energies 
involved, although high enough to excite the electronic cloud of the atom, are much too small to excite 
the nucleus. 
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— if Z = 1/2, the nucleus has an electric charge and a magnetic dipole 
moment [parity rule (50-a) excludes an electric dipole moment]. This is 
the case for the >He nucleus and the H nucleus (i.e., the proton), as well as 
for all spin 1/2 particles (electrons, muons, neutrons, etc. ). 

— if 7 = 1, we must add the electric quadrupole moment to the charge 
and the magnetic dipole moment. This is the case for °H (deuterium), °Li, etc. 


This argument can be generalized to any value of /. Actually, very few 
nuclei have spins greater than 3 or 4. 


B. Matrix elements between states of different quantum numbers 


For arbitrary /,, /4,, m, and m,, the selection rules must be applied in their 
general forms, (42), (43) and (44). Consider, for example, a particle of charge g 
subjected to a central potential W (r), whose stationary states are the states | £,, , » >. 
Assume that we then add an additional electric field $, uniform and parallel to Oz. 
In the corresponding coupling Hamiltonian, the only non-zero term is the electric 
dipole term (cf: $1-b-P): 

V(R) = - D. 

=-Dé (51) 
As we saw in (22), the operator D. is equal to the operator O?. Selection rules (42) 
and (43) then indicate that : 

— the states | y, ¡ ,, » coupled by the additional Hamiltonian V(R) necessa- 
rily correspond to the same value of m. 

— the /-values of two states necessarily differ by + 1 [they cannot be equal, 
according to (44-a)]. We can predict without calculation that a large number of 
matrix elements of V(R) are zero. This considerably simplifies, for example, the 


study of the Stark effect (cf. complement E,,,), and that of the selection rules gover- 
ning the emission spectrum of atoms (cf. complement Axu). 


References and suggestions for further reading: 


Cagnac and Pebay-Peyroula (11.2), annexe IV; Valentin (16.1), chap. VIII; 
Jackson (7.5). chaps. 4 and 16. 
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Complement Fx 


EVOLUTION OF TWO ANGULAR MOMENTA J, AND J, 
COUPLED BY AN INTERACTION aJ, . J; 


1. Classical review 


a. Equations of motion 
b. Motion of $, and $, 


2. Quantum mechanical evolution of the mean values < J, > and < J, > 


, .d d 
a. Calculation fais Ji? and < J, > 
b. Discussion 


3. The special case of two spin 1/2’s 


a. Stationary states of the two-spin system 
b. Calculation of < S, X(t) 
c. Discussion. Polarization of magnetic dipole transitions 


4. Study of a simple model for the collision of two spin 1/2 particles 


a. Description of the model 
b. The state of the system after collision 
d. Discussion. Correlation introduced by the collision 


In a physical system, we must often consider the effect of a coupling between two 
partial angular momenta J, and J,. These can, for example, be the angular 
momenta of two electrons of an atom, or the orbital and spin angular momenta 
of an electron. In the presence of such a coupling, J, and J, are no longer constants 
of the motion; only : 


J=3,+3, (1) 


commutes with the total Hamiltonian of the system. 
We shall assume that the term of the Hamiltonian which introduces a coupling 
between J, and J, has the simple form: 


W=aJ,.J, (2) 


where a is a real constant. Such a situation is frequently encountered in atomic 
physics. We shall see numerous examples in chapter XII, when we use perturbation 
theory to study the effect on the hydrogen atom spectrum of interactions involving 
electron or proton spins. When the coupling has the form (2), classical theory 
predicts that the classical angular momenta $, and §, will precess about their 
resultant 4 with an angular velocity proportional to the constant a (cf. $1 below). 
The “vector model” of the atom, which played a very important role in the history of 
the development of atomic physics, is founded on this result. In this complement, 
we shall show how, with the knowledge of the common eigenstates of J? and J,, 
one can study the motion of the mean values < J, > and < J, >, and again derive, 
at least partially, the results of the vector model of the atom ($$ 2 and 3). In addition, 
this study will enable us to specify in simple cases the polarization of the electro- 
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magnetic waves emitted or absorbed in magnetic dipole transitions. Finally, (§ 4), 
we shall take up the case in which the two angular momenta J, and J, are coupled 


only during a collision but not permanently. This case will serve as a simple illustra- 
tion of the important concept of correlation between two systems. 


1. Classical review 


a. EQUATIONS OF MOTION 


If 6 is the angle between the classical angular momenta $, and $, (fig. 1), 
the coupling energy can be written: 


W =a$,.$2.=4f, 91,0080 (3) 


Let M be the energy of the total system in the absence of coupling [24 can represent, 
for example, the sum of the rotational kinetic energies of systems (1 ) and (2)]. We 
shall assume: 


W<H, (4) 


A 
A 


FIGURE l 


Two classical angular momenta f, and $, coupled by an inter- 
action W = af, . $, = af A cos 9. 


Let us calculate the moment # , of the forces acting on system (1). Let u 
be a unit vector and dW, the variation of the coupling energy when the system (1) 
is rotated through an angle dx about u. We know (the theorem of virtual work) 
that: 


M u= -— (5) 


Starting with (3) and (5), we then obtain, via a simple calculation : 


M, = — af, x J, (6-a) 

M, = — af, x Jı (6-b) 
and, consequently : 

H = — ug, x g, (T-a) 

Bags (7-b) 
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b. MOTION OF $, AND f, 
Adding (7-a) and (7-b), we obtain: 


SIA +9) =0 (8) 


which shows that the total angular momentum $, + § , is indeed a constant of the 
motion. Furthermore, it can easily be deduced from (7-a) and (7-b) that: 


ZORSE ° 
and: 
sas) + (GH) #2 = GHG) = (10) 


The angle O between f, and Y,, as well as the moduli of J, and JY,, therefore 
remain constant over time. Finally : 


d 
ae es A$ Fi) x $i = ag x fi (11) 


Since 4 = $, + $, is constant, the preceding equation shows that $ , precesses 
about 4 with an angular velocity equal to a| $| (fig. 2). 


FIGURE 2 


Under the effect of the coupling Y = a J, ¿ Í, the angular 
momenta ¥, and Ž, precess about their resultant ¥, which is a 
constant of the motion. 


Under the effect of the coupling, ¥ , and # , therefore precess about their 
resultant 4 with an angular velocity proportional to | | and to the coupling 
constant a. 


2. Quantum mechanical evolution of the mean values < J, > and < J, > 


d d 
a. CALCULATION OF act? AND qt? 


Recall, first of all, that if 4 is an observable of a quantum mechanical system 
of Hamiltonian H, we have (cf. chap. MI, $D-1-d): 


GAO == < 14.100 (12) 
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In the present case, the Hamiltonian H is equal to: 
H=H,+W (13) 


where H, is the sum of the energies of systems (1) and (2), and W is the coupling 
between J, and J, given in (2). In the absence of such a coupling, J, and J, are 
constants of the motion (they commute with Ho). Therefore, in the presence of the 
coupling, we have simply : 


d 


Echn) =a [So WID = [Se 5 LD (14) 


and an analogous expression for =< J, >. The calculation of the commutator 

appearing in formula (14) does not present any difficulty. We have, for example : 
[yy J; . J,] = [ix J ty ay] t Lix 412722] 

BJ Say — AJia 

= — (3, x Ja) (15) 


From this, we see finally that: 
EI) = -alh x> (16-a) 


FI) = -alh xJ) (16-b) 


b. DISCUSSION 


Note the close analogy between formulas (7-a) and (7-b) on the one hand and 
formulas (16-a) and (16-b) on the other. Adding (16-a) and (16-b), we again find 
that J is a constant of the motion, since: 


d d d 
A E Sl (17) 
However, we must recall that, in general : 


<A) AXIS) XC Ind (18) 


The motion of the mean values is therefore not necessarily identical to the classical 
motion. To examine this point in greater detail, we shall now study a special case : 
that in which J, and J, are two spin 1/2’s, which we shall denote by S, and S,. 


3. The special case of two spin 1/2's 


The evolution of a quantum mechanical system can easily be calculated in 
the basis of eigenstates of the Hamiltonian of this system. Therefore, we shall begin 
by determining the stationary states of the two-spin system. 
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a. STATIONARY STATES OF THE TWO-SPIN SYSTEM 
Let 


S=S +5, (19) 
be the total spin. Squaring both sides of (19), we obtain : 
S? = Si + 83 + 28, .8S, (20) 


which enables us to write W in the form: 


W=a Sı- S: = 518° - s} - si] =5|S -3% | (21) 


(all vectors of the state space are eigenvectors of S? and Sí with the eigenvalue 3h7/4). 

In the absence of coupling, the Hamiltonian H, of the system is diagonal in 
the { | e,, €, > } basis (with e, = +, £, = +) of eigenstates of S,. and S,., as well 
as in the [| S, M > } basis (with S = Oor i, — S < M < + S) of eigenstates of S? 
and S,. The various vectors | «,, £, > or | S, M > are eigenvectors of Ho with the 
same eigenvalue, which we shall take to be the energy origin. 

When we take the coupling W into account, we see from formula (21) that the 
total Hamiltonian H = H} + W is no longer diagonal in the { | ¢,, s3 > } basis. 
However, we may write: 


ah? 
2 


(H, + W)|S.M) = [ss +1) =35|15.10) (22) 


The stationary states of the two-spin system therefore separate into two levels 
(fig. 3): the S = 1 level, three-fold degenerate with energy E, = ah*/4, and the 


Ho H, + W 


3ah? \ y ey 
7 = 
FIGURE 3 
Energy levels of a system of two spin 1/2”s. On the left-hand side of the figure, the coupling is assumed 
to be zero, and we obtain a single level which is four-fold degenerate. The coupling W = aS, .S, 
splits it into two distinct levels, separated by an energy of uh? : the triplet level (S = 1, three-fold 
degenerate) and the singlet level (S = 0, non-degenerate). 
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non-degenerate S = 0 level, with energy E, = — 3ah?/4. The splitting between the 
two levels is equal to ah?. If we set: 


ah? =n0 (23) 


02/21 is the only non-zero Bohr frequency of the two-spin system. 


b. CALCULATION OF < S, >(£) 


In order to find the evolution of < S, >(¢), we must first calculate the matrices 
representing Six. S,, and S,, (or, more simply, S,, and S,, = Six +S,,) in 
the { | S, M X } basis of stationary states. If we use expressions (B-22) and (B-23) 
of chapter X, which give the expansion of the states | S, M > on the [|e,,8, >) 
basis, it is possible to easily calculate the action of S,, or S,, on the kets | S, M >. 
Thus we find : 


. h 

ENTE = 511) 

So = Elo) 
A (24) 
h 

Sid 13 eal? 
h 

510,0) = 311.0) 

and: 
US, [LI =0 


h 
M is 
V2 


4 A A (25) 


TE a O 
af 2 
h 
V2 


From this, we can immediatelysderive the matrices representing S,. and S,. in the 


basis of the four states | S, M > arranged in the order | 1, 1 >, | 1,05, |1, — 1> 
and | 0,0 >: 
1 0 0 0 0 I 0 —1 
h jO 0 0 l h 0 0 1 0 
(S,-) => (S,4) = — 
210. Gah 6 Bio o 0 0 
o 1 0 0) (26) 0 0 1 0) 27 
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COMMENT: 
It can easily be shown that the restrictions of the S,, and S,, matrices to the 
= | subspace are proportional respectively (with the same proportionality coefficient) 
to the matrices representing S, and S, in the same subspace. This result could have been 
expected, in view of the Wigner-Eckart theorem relative to vector operators (cf. com- 
plement D,). 


Let 
|w(0)> =x|0,0> + B-,|1,- 1> + Bo|1,0> + 8, | 1,1) (28) 


be the state of the system at the instant 1 = 0. From this we deduce the expression 
for | W(t) > (to within the factor ediaht/ 4). 


| W(t) > = 4 |0,0>+ [B-,|1,-1> + Bo]1,0> + 8,|1,1>]07*” (29) 
It is then easy to obtain, using (26) and (27): 
< Sis O =< W(t) |S. Iyl) > 


=> CIB RIP +e apg +e ot By] (30) 
< Si+ >(£) =< w(t) | Si+ | wit)> 
= ZL PtP + BIB ~ e% pra + 0 af (31 
< Six X(t) and < S,, X(t) can be expressed in terms of < S,, »(1): 
< Six X(t) = Re < Sy, >(£) (32) 
È Siy 0) = Im Ç Si+ >(6) (33) 


Analogous calculations enable us to obtain the three components of < S, >(£). 


c. DISCUSSION. 
POLARIZATION OF MAGNETIC DIPOLE TRANSITIONS 


Studying the motion of < S, >(t) does more than compare the vector model 
of the atom with the predictions of quantum mechanics. It also enables us to specify 
the polarization of the electromagnetic waves emitted due to the motion of < S, >(t). 

The Bohr frequency 2/27 appears in the evolution of < S, >(+) because of the 
existence of non-zero matrix elements of S,,, S,,, or S,, between the state | 0, 0 > 
and one of the states | 1, M > (with M = — 1,0, + 1). In (28) or (29), we shall 
begin by assuming that, with a non-zero, only one of the three coefficients f_ ,, Bo 
or f, is different from zero. The examination of the motion of < S, >(z) in the three 
corresponding cases thus will enable us to specify the polarization of the radiation 
associated with the three magnetic dipole transitions : 


[0,0> <> |1,0>, |0,0> <=>|1,1> and |0,0> <> |1,-1> 
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We can always choose a to be real; we shall set: 


Bu = |Bule®™ (M =- 1,0, 1) (34) 


COMMENT: 


Actually, the electromagnetic waves are radiated by the magnetic moments M, 
and M, associated with S, and S, (hence the name, magnetic dipole transitions). M, 
and M, are proportional! respectively to S, and S,. To be completely rigorous, we should 
then study the evolution of <M, + M, >(t). Here we shall assume < M, > > <M, >. 
Such a situation is found, for example, in the ground state of the hydrogen atom : the 
hyperfine structure of this state is due to the coupling between the spin of the electron 
and that of the proton (cf. chap. XH, $ D). But the magnetic moment of the electron spin 
is much larger than that of the proton, so that the emission and absorption of electro- 
magnetic waves at the hyperfine transition frequency are essentially governed by the 
motion of the electron spin. Taking < M, > into account as well would complicate the 
calculations without modifying the conclusions. 


a. The|0,0> <=> |1,0> transition (B, = B_, =0) 
If we take B, = B_, = 0 in (30), (31), (32) and (33), we get: 


(Siz (0) = < Sap (1) = 0 


<S 1, >(t) = ha [Bo] cos (Qt — po) (35) 
Furthermore, it can easily be seen that: 
(S, >(t) = < S, >) = < S, >) = 0 (36) 


<S, >(£) and < S, X(t) are then permanently of opposite direction and vibrate 
along Oz at the frequency Q/2x (fig. 4). 


<S,> (t) SIGURE 4 


If the state of the two-spin system is a superposition of only the two stationary 
states |0,0 > and |1, 0 >, < S, > and < S, > are always of opposite direction and 
vibrate along Oz at the frequency Q/2x. 


CS, > (t) 


The electromagnetic waves emitted by < S, > therefore have a magnetic field * 
linearly polarized along Oz (“rz polarization”). 


* Since these are magnetic dipole transitions, we are concerned with the magnetic field vector 
of the radiated wave. In the case of an electric dipole transition (cf. complement Dyn $ 2-c), on the other 
hand, we would be concerned with the radiated electric field. 
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We see in this example that (< S] >)? varies over time and is therefore not 
equal to < S? > (which is constant and equal to 37/4). This represents an important 
difference with the classical situation studied in $ 1, in which 4, maintains a constant 
length over time. 

P. The|0,0><=> |1,1> transition (By = B_, =0) 

We find in this case: 


h 
oe >(£) = 2 IB)? 


CS) = = Toa fal cos (Ot — o) 


h 
S,,>() = -—= 
< Siy D(t) we 
Furthermore, it can easily be verified that: 
< S; >(£) =ħ [8,1? (38) 
CS, (0) = < S, X(t) = 0 


From this, it can be seen (fig. 5) that <S, >(+) and < S, >(£) precess counter- 
clockwise at an angular velocity Q about their resultant < S >, which is parallel to Oz. 
The electromagnetic waves emitted by <S, >(1) in this case therefore have a 
right-hand circular polarization (*o, polarization”). 


a |B,| sin (2r — p,) (37) 


FIGURE 5 


If the state of the two-spin system is a superposition of only the 
stationary states | 0, 0 > and | 1, 1 >, < S, > and < S, > precess 
counterclockwise about their resultant < S >, with the angular 
velocity Q. 


Note that here the motion obtained for the mean values < S, > and < S, > 
is the classical motion. 


Y. The |0,0> <» | 1, — 1 > transition (Bo = B, = 9) 


The calculations are closely analogous to those of the preceding section 
and lead to the following result (fig. 6): < S, >(£) and < S, >(1) precess about Oz, 
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again at the angular velocity Q, but in the clockwise direction. It must be noted 
that < S, > = — h]8 _ ,|* is now negative, so that while the direction of the precession 
of < S; > and < S, > about Oz is different from what it was in the preceding case, 
it remains the same relative to < S >. The electromagnetic waves emitted by < S, > 
are now left-hand circularly polarized (“o _ polarization”). 


FIGURE 6 


If the state of the two-spin system is a superposition of only the 
stationary states | 0, 0 > and | 1, — 1 >. < S, > and < S, > still 
precess in the counterclockwise direction with the angular velo- 
city Q about their resultant < S >; however, the latter is now 
directed opposite to Oz. 


ò. General case 


In the general case (any a, B_,, Bo and f, ), we see from (30), (31), (32) and (33) 
that the components of < S, »(t) on the three axes contain a static part and a part 
modulated at the frequency 92/27. Since these three projected motions are sinusoidal 
motions of the same frequency, the tip of < S, >(z) describes an ellipse in space. 
Since the sum 


CS, M0) + <S >) = <5S> 


remains constant. the tip of < S, >(1) also describes an ellipse (fig. 7). 


FIGURE 7 


Motion of < S, >(¢) and < S, >(¢)in the general case, in which the 
state of the two-spin system is a superposition of the four statio- 
nary states [0,0 », [1,1 >, |1,0> and | 1, — 1 >. The resul- 
tant < S > is still constant but is not necessarily directed along OZ. 
< S, > and < S, > no longer have constant lengths, and their tips 
describe ellipses. 
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Thus we find for the general case only part of the results of the vector model 
of the atom. We do find that, the larger the coupling constant a, the more rapidly 
< S, >(t)and < S, >(£) precess about < S >. However, as we saw clearly in the special 
case a studied above, |< S, »(1)| is not constant, and the tip of < S, »(£) does not 
describe a circle in the general case. 


4. Study of a simple model for the collision 
of two spin 1/2 particles 


a. DESCRIPTION OF THE MODEL 


Consider two spin 1/2 particles, whose external degrees of freedom we shall 
treat classically and whose spin degrees of freedom we shall treat quantum mechanic- 
ally. We shall assume that their trajectories are rectilinear (fig. 8) and that the inter- 
action between the two spins S, and S, is of the form W = aS, .S,, where the 
coupling constant a is a rapidly decreasing function of the distance r separating the 
two particles. 


Interaction region 


ee, 


oe SN 


a“ 


FIGURE 8 


nected Collision between two spin 1/2 particles (1) 
je on (2) and (2) whose orbital variables can be 


treated classically. The spin state of each 
particle is represented by a large arrow. 


Since r varies over time, so does a. The shape of the variation of a with respect 
to £ is shown in figure 9. The maximum corresponds to the time when the distance 
between the two particles is at a minimum. To simplify the reasoning, we shall 
replace the curve in figure 9 by the one in figure 10. 


A alt) 


FIGURE 9 


Shape of the variation of the cou- 
pling constant a(t) during the 
collision. 


t 
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a(t) 


a 
FIGURE 10 
Simplified curve used to represent 
schematically the variation of the 
coupling constant a(t) during the 
0 T t collision. 


The problem we have here is the following : before the collision, that is, at 
t = — œ, the spin state of the two-particle system is : 


|¥(— 0) =|+, -=> (39) 


What is the state | y(+ 00) > of the system after the collision? 


b. THE STATE OF THE SYSTEM AFTER COLLISION 
Since the Hamiltonian is zero for t < 0, we have: 


| ¥(0) > = | W(— ©) > =| +,-> 
1 
IA (40) 
The results of the preceding section concerning the eigenstates and eigenvalues 
of W = aS, .S, are applicable between times 0 and T and enable us to calculate 


| V(T)>: 
| y(T)> = os [|1,0>e HT + 10,0 > e ery (41) 


V2 


Multiplying (41) by the global phase factor e*Eo*£17/2% (of no physical importance), 
setting E, — E, = hQ [cf. formula (23)], and returning to the { | ¢,, 8, > } basis, 
we find : 


|W) > = cos EE] +, => sin], +) (42) 
Finally, since the Hamiltonian is zero for t > T, we have: 

y(+00)> =| ¥(T)> (43) 

COMMENT: 


The calculation could be performed for an arbitrary fun :tion a(t) of 
the type shown in figure 9. It would then be found necessary t > replace, in 


the preceding formula, aT = a by | a(t) dt (cf. exercise : of comple- 


00 


ment Ex) 
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c. DISCUSSION. 
CORRELATION INTRODUCED BY THE COLLISION 


If the condition: 


a = 5 +kr , kan integer 2 0 (44) 


is satisfied, we see from (42) that : 
y(+00)> =|-=,+> (45) 


The orientation of the two spins, in this case, is exchanged during the collision. 
On the other hand, if: 


er _ 


5 kn , kan integer 2 0 (46) 
we find that: 
|¥(+ 00)>=| +,->=|w(- 00)> (47) 


In this case, the collision has no effect on the orientation of the spins. 
For other values of 7, we have: 


u(+ 0)> =a]+,->+8B|-,+> (48) 


with a and $ simultaneously non-zero. The state of the two-spin system has been 
transformed by the collision into a linear superposition of the two states | +, — > 
and| —, +>.| (+ 00) > is therefore no longer a tensor product, although |y(— 0)> 
was one : the interaction of the two spins has introduced correlations between them. 

To see this, we shall analyze an experiment in which, after the collision, an 
observer [observer (1)] measures S,,. According to formula (48) for | y(+ 00) >, he 
has the probability |a|? of finding + ħ/2 and |p|? of finding — f/2 [according to (42), 
la]? + |p|? = 1]. Assume that he finds — h/2. Immediately after this measurement, 
the state of the total system is, according to the wave packet reduction postulate, 
| —, + >. If, at this moment, a second observer [observer (2)] measures S,,, he will 
always find + h/2. Similarly, it can easily be shown that if observer (1) finds the 
result + h/2, observer (2) will then always find — h/2. Thus, the result obtained 
by observer (1) critically influences the result that observer (2) will obtain later, 
even if at the time of these two measurements, the particles are extremely far apart. 
This apparently paradoxical result (the Einstein-Podolsky-Rosen paradox) reflects 
the existence of a strong correlation between the two spins, which has appeared 
because of their interaction during the collision. 


Note, finally, that if we are concerned with only one of the two spins, it is impossible 
to describe its state after the collision by a state vector, since, according to formula (48), 
| ¥(+ 00) > is not a tensor product. Spin (1), for example, can be described in this case only 
by a density operator (cf. complement E,,,). Let 


p = |y(+ 00)><y(+ w)| (49) 


be the density operator of the total two-spin system. According to the results of complement Eir 
(§ 5-b), the density operator of spin (1) can be obtained by taking the partial trace of p with 
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respect to the spin variables of particle (2): 


p(1) = Tr, p (50) 
Similarly : 
pQ)=Tr, p (51) 


It is easy to calculate, from expression (48) for | Y(+ 00) », the matrix representing p in the 
four-state basis, {| +, + >, | +, — >), | —. + >. | —.— > }, arranged in this order. We find: 


0 0 0.0 
O la? ap* 0 
p= (52) 
0 pa* pP 0 
0 0 0 0 
Applying (50) and (51), we then find : 
al? 0 
p(l) = ( | ) (53) 
o (Al 
BP o 
p(2) = ( 54 
o al? (54) 
Starting with expressions (53) and (54), we can form: 
P = pil) @ p(2) (55) 


whose matrix representation can be written: 


jaj? Ig? 0 œ 0 
0 alt 0 0 

p= Ht (56) 
0 o IB 0 


0 0 0 WPP 


We see that p’ is different from p, reflecting the existence of correlations between the two spins. 


References and suggestions for further readings 
The vector model of the atom: Eisberg and Resnick (1.3), chap. 8, $5; Cagnac 


and Pebay-Peyroula (11.2), chaps. XVI, §3B and XVII, §§3E and 4C. 
The Einstein-Podoisky-Rosen paradox : see references of complement Dn- 
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Complement Gx 
EXERCISES 


1. Consider a deuterium atom (composed of a nucleus of spin 7 = | and an 
electron). The electronic angular momentum is J = L + S, where L is the orbital 
angular momentum of the electron and S is its spin. The total angular momentum 
of the atom is F = J +1, where I is the nuclear spin. The eigenvalues of J? and F? 
are J(J + 1)h? and F(F + 1)h? respectively. 

a. What are the possible values of the quantum numbers J and F for 
a deuterium atom in the ls ground state? 

b. Same question for deuterium in the 2p excited state. 


2. The hydrogen atom nucleus is a proton of spin / = 1/2. 

a. In the notation of the preceding exercise, what are the possible values of 
the quantum numbers J and F for a hydrogen atom in the 2p level? 

b. Let {|n,1,m) } be the stationary states of the Hamiltonian H, of the 
hydrogen atom studied in §C of chapter VII. 

Let { |x, L s, J, M, > } be the basis obtained by adding L and S to form J 


(Mh is the eigenvalue of J,); and let { | n, L s, J, I, F, Mp > } be the basis obtained 
by adding J and I to form F (Mñh is the eigenvalue of F,). 


The magnetic moment operator of the electron is : 


In each of the subspaces &(n = 2, I = 1, s = 1/2, J, I = 1/2, F) arising from the 
2p level and subtended by the 2F + 1 vectors 


l 


1 
=2l=1 y = — = — 
|n ; „5S „J, I > 


2 , F, Mp > 


corresponding to fixed values of J and F, the projection theorem (cf. complement Dy, 
§§ 2-c and 3) enables us to write: 


M = Op gE /h 


Calculate the various possible values of the Landé factors g,, corresponding to 
the 2p level. 


3. Consider a system composed of two spin 1/2 particles whose orbital variables 
are ignored. The Hamiltonian of the system is: 


H =0,8,, + @,S), 


where S,, and S,, are the projections of the spins S, and S, of the two particles 
onto Oz, and œw, and œw, are real constants. 
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a. The initial state of the system, at time £ = 0, is: 
1 


y 


(with the notation of $ B of chapter X). At time 1, S? = (S, + S,)? is measured. 
What results can be found, and with what probabilities? 


|¥0)> = Ll + =>+|- +] 


Nl 


b. If the initial state of the system is arbitrary, what Bohr frequencies can 
appear in the evolution of < S? >? Same question for S, = S,, + Sax 


4. Consider a particle (a) of spin 3/2 which can disintegrate into two particles, 
(b) of spin 1/2 and (c) of spin 0. We place ourselves in the rest frame of (a). Total 
angular momentum is conserved during the disintegration. 


a. What values can be taken on by the relative orbital angular momentum 
of the two final particles? Show that there is only one possible value if the parity 
of the relative orbital state is fixed. Would this result remain valid if the spin of 
particle (a) were greater than 3/2? 

b. Assume that particle (a) is initially in the spin state characterized by the 
eigenvalue m,h of its spin component along Oz. We know that the final orbital 
state has a definite parity. Is it possible to determine this parity by measuring the 
probabilities of finding particle (b) either in the state | + > or in the state | — > 
(you may use the general formulas of complement Ax, §2)? 


5. Let S=S, +S, + S, be the total angular momentum of three spin 1/2 
particles (whose orbital variables will be ignored). Let | ¢,, e,, e, > be the eigenstates 
common to S,,, S22 53,, of respective eigenvalues e, h/2, e, h/2, a3 ħ/2. Give a basis 
of eigenvectors common to S? and S,, in terms of the kets | e,, £2, ez >. Do these 
two operators form a C.S.C.O.? (Begin by adding two of the spins, then add the 
partial angular momentum so obtained to the third one.) 


6. Let S, and S, be the intrinsic angular momenta of two spin 1/2 particles, 
R, and R,, their position observables, and m, and m,, their masses | with 


mı 


m : y 
i= oe the reduced mass} Assume that the interaction W between the two 
1 2 


particles is of the form: 


S, .S, 


W = UR) + VIR) 


where U(R) and V(R) depend only on the distance R = |R; — R,| between the 
particles. 
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a. Let S = S, + S, be the total spin of the two particles. 


a. Show that: 

P = 3 $ S; E 
4 ht 

1 S, .S, 

Pa 1 2 
° 4 h? 


are the projectors onto the total spin states S = l and S = 0 respectively. 


p. Show from this that W = W,(R)P, + W¿(R)P,, where W,(R)and W (R) 
are two functions of R, to be expressed in terms of U(R) and V(R). 

b. Write the Hamiltonian H of the “relative particle” in the center of mass 
frame; P denotes the momentum of this relative particle. Show that H commutes 
with S? and does not depend on S.. Show from this that it is possible to study 
separately the eigenstates of H corresponding to S = | and S = 0. 

Show that one can find eigenstates of H, with eigenvalue E. of the form: 


+1 
| Yr) = Ago |e >|S =0,M =0) t y milo» |S =1,M> 
M=-1 


where Ago and 4, y are constants, and | o$ > and | p} > are kets of the state space 6, 
of the relative particle (Mh is the eigenvalue of S_). Write the eigenvalue equations 
satisfied by | pp > and | pz > 

c. We want to study collisions between the two particles under consideration. 
Let E = h?k?/2u be the energy of the system in the center of mass frame. We assume 
in all that follows that, before the collision, one of the particles is in the | + > spin 
state, and the other one, in the | — > spin state. Let ly | > be the corresponding 
Stationary scattering state (cf. chap. VHI, $B). Show that: 


i 
lop >|S=0,M =0> +—=lp{>|S=1,M =0) 


Wid = 
dá J2 Re 


where | of > and | pi > are the stationary scattering states for a spinless particle 
of mass u, scattered respectively by the potentials W,(R) and W (R). 

d. Let fo(0) and f, (0) be the scattering amplitudes which correspond to | py > 
and | p; >. Calculate, in terms of fo(0) and /,(0). the scattering cross section 0,(0) 
of the two particles in the 0 direction, with simultaneous flip of the two spins (the 
spin which was in the | + > state goes into the | — > state, and vice versa). 

e. Let ò? and 6} be the phase shifts of the / partial waves associated respectively 
with W (R) and W (R) (cf. chap. VHI, $C-3). Show that the total scattering cross 
section o,, with simultaneous flip of the two spins. is equal to : 


A (21 + 1) sin? (6! — ò?) 
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7. We define the standard components of a vector operator V as the three 
operators: 


yi) = E JE (VY. + iV,) 
V2 
Vii Y 
y“! i = ao (Y, ps iV,) 
l> í ; 


Using the standard components Vi ™ and W4'? of the two vector operators V and W, 
we construct the operators: 


[vt @ Wry =p a| K, M> VWP 


p a 


where the < I, |; p, q |K. M > are the Clebsch-Gordan coefficients entering into 
the addition of two angular momenta | (these coefficients can be obtained from 
the results of $1 of complement Ax). 

a. Show that [V") @ W'"]0” is proportional to the scalar product V . W 
of the two vector operators. 

b. Show that the three operators [V @ W'"]/¡' are proportional to the 
three standard components of the vector operator V x W. 

c. Express the five components [V @ WI in terms of K, Y, = V, + iV, 
W.W ME iW, 

d. We choose V = W = R, where R is the position observable of a particle. 
Show that the five operators [R'} @ R")]{7) are proportional to the five compo- 
nents QO” of the electric quadrupole moment operator of this particle [ ef. formula (29) 
of complement Ex]. 

e. We choose V = W = L, where L is the orbital angular momentum of the 
particle. Express the five operators [LH Y L“']¿' in terms of L., L,, L_. What are 
the selection rules satisfied by these five operators in a standard basis { |k, 4m > } 
of eigenstates common to L? and L, (in other words, on what conditions 1s the 
matrix element 


Ck hm | [LO LY Lk im > 


non-zero)? 


8. Irreducible tensor operators ; Wigner-Eckart theorem 


The 2K + | operators 7g*', with K an integer > 0 and 
Ose hoe AA 


are, by definition, the 2K + | components of an irreducible tensor operator of 
rank K if they satisfy the following commutation relations with the total angular 
momentum J of the physical system: 
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(J. TY] = AOTP (1) 
(Ji, TÆ] = AV K(K + 1)- Q + NTS (2) 
[J-, TÆ] = AV K(K + 1) — Q(Q - NTH, (3) 


a. Show that a scalar operator is an irreducible tensor operator of rank K = 0, 
and that the three standard components of a vector operator (cf. exercise 7) are 
the components of an irreducible tensor operator of rank K = 1. 

b. Let [| k, J, M > } be a standard basis of common eigenstates of J? and 
J,. By taking both sides of (1) to have the same matrix elements between | k, J, M > 
and | k’, J’, M' >, show that < k, J, M | TY |k’, J’, M' > is zero if M is not equal 
to Q + M'. 

c. Proceeding in the same way with relations (2) and (3), show that the 
(2J + 1)(2K + 1)(27' + 1) matrix elements < k, J, M | TY | k', J’, M' > corres- 
ponding to fixed values of k, J, K, k’, J’ satisfy recurrence relations identical to 
those satisfied by the (2J + 1)(2K + 1)(27’ + 1) Clebsch-Gordan coefficients 
<J', K; M’, Q |J, M X (cf. complement By, $$ 1-c and 2) corresponding to fixed 
values of J, K, J’. 

d. Show that: 


<k, J, M| TE |k, J’, M =0(J',K;M',Q|J,M> (4) 


where « is a constant depending only on k, J, K, k*, J’, which is usually written in the 
form: 


a = —— C k, J | TO | kD 


V2J +1 


e. Show that, conversely, if (2K + 1) operators 7j*? satisfy relation (4) for 
all | k, J, M > and | k’, J’, M' >, they satisfy relations (1), (2) and (3), that is, they 
constitute the (2K + 1) components of an irreducible tensor operator of rank K. 

f. Show that, for a spinless particle, the electric multipole moment opera- 
tors Q7' introduced in complement E, are irreducible tensor operators of rank / 
in the state space &, of this particle. Show that, in addition, when the spin degrees 
of freedom are taken into account, the operators Q7 remain irreducible tensor 
operators in the state space 6, Y &, (where @, is the spin state space). 

g. Derive the selection rules satisfied by the Q in a standard basis 
{|k, l, J, M, > ) obtained by adding the orbital angular momentum L and the 
spin S of the particle to form the total angular momentum J = L + S [XI + 1)h?, 
J(J + 1)h?, M,h are the eigenvalues of L?, J?, J, respectively]. 


9. Let Af be an irreducible tensor operator (exercise 8) of rank K, acting in 
a state space &,, and Bf”), an irreducible tensor operator of rank K, acting in 
a state space &,, With Aj." and Bf”, we construct the operator : 


cy = AS @ BENS = » < Ki, K,3Q;,Q, | K,Q > AG? BS? 
0102 
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a. Using the recurrence relations for Clebsch-Gordan coefficients (cf. com- 
plement B,), show that the C$ satisfy commutation relations (1), (2) and (3) of 
exercise 8 with the total angular momentum J = J, + J, of the system. Show that 
the CHP are the components of an irreducible tensor operator of rank K. 


b. Show that the operator Y (— 1)245?B%) is a scalar operator (you may 


Q 
use the results of $3-d of complement By). 


10. Addition of three angular momenta 


Let &(1), &(2), €(3) be the state spaces of three systems, (1), (2) and (3), of 
angular momenta J,, J,, J,. We shall write J = J, + J, + J, for the total angular 
momentum. Let { | ka Ja Ma) y {| Kus dy Mp >}, {| Kes Jo Me > } be the standard 
bases of &(1), &(2), &(3), respectively. To simplify the notation, we shall omit the 
indices k,, k,, Ke as we did in chapter X. 

We are interested in the eigenstates and eigenvalues of the total angular 
momentum in the subspace &(/,, j,, JL) subtended by the kets: 


{ | jm, > | jm, > | jun. > } 
— Ja S Ma Sje — Jy S My Sys — Je SM Sje (1) 


We want to add j,, j,, j, to form an eigenstate of J? and J, characterized by the 
quantum numbers j, and m,. We shall denote by: 


| Jai Unido: jm y (2) 


such a normalized eigenstate obtained by first adding j, to j, to form an angular 
momentum j,, then adding j, to j, to form the state |j ¡mM y >. One could also add j, 
and j, to form jg and then add j, to j, to form the normalized state | jpm; >, 
written : 


| Gado digs des Spy > (3) 

a. Show that the system of kets (2), corresponding to the various possible 
values of j,, jp, My, forms an orthonormal basis in &(/,, js Je). Same question for the 
system of kets (3), corresponding to the various values of j,, jp, my. 

b. Show, by using the operators J,, that the scalar product 
CU Jj Spy PANAS ; jm, > does not depend on m,, denoting such a scalar 
product by < (Jaji jo» Jedy | Jas je Sy >. 

c. Show that: 


| jas Unides jm; > = 2 < VAN des Jp lias Unides dy > | Vania. J My > (4) 
Jg 
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d. Using the Clebsch-Gordan coefficients, write the expansions for vectors (2) 
and (3) in the basis (1). Show that : 


2 < Jide: Mps Me | jo Me > ida Mas Me AZ > = 
Me 


2 ld Ma» Mp lj m, > Cilek: m,, Me |j, my > 


JgMg 


x< (isda testy PANA MAA > (5) 


e. Starting with relation (5), prove, using the Clebsch-Gordan coefficient 
orthogonality relations, the following relations : 


2, < jije: Mp, Me | je Me > jasle Mas Me |j, my > < ja Ma | jade: Mas Mp > 


MaMp Me 
= Calo My, Mo Lis. My > < Cialniardes iy | jas Une iy » (6) 
as well as: 


C Cisisdiardesdy Liar Unidiep >= 5G < josje! Mp, Me | jes Me > 
iy 


MaM MMMM $ 


x C jaje Mas Me Lips My > < ja my |Jaja: Mas M, > <jr My loje; Ma m.> (7) 


Exercises 8 and 9: 


References : see references of complement Dy 


Exercise 10: 


References: Edmonds (2.21), chap. 6; Messiah (1.17), $XII1-29 and App. C; 
Rose (2.19), App. I. 
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The quantum mechanical study of conservative physical systems (that is, systems 
whose Hamiltonians are not explicitly time-dependent) is based on the eigenvalue 
equation of the Hamiltonian operator. We have already encountered two important 
examples of physical systems (the harmonic oscillator and the hydrogen atom) 
whose Hamiltonians are simple enough for their eigenvalue equations to be solved 
exactly. However, this happens in only a very small number of problems. In 
general, the equation is too complicated for us to be able to find its solutions in an 
analytic form*. For example, we do not know how to treat many-electron atoms. 
even helium, exactly. Besides, the hydrogen atom theory explained in chap- 
ter VII (§C) takes into account only the electrostatic interaction between 
the proton and the electron; when relativistic corrections (such as magnetic forces) 
are added to this principal interaction, the equation obtained for the hydrogen 
atom can no longer be solved analytically. We must then resort to solving it 
numerically, frequently with a computer. There exist, however, approximation 
methods which enable us to obtain analytically approximate solutions of the 
basic eigenvalue equation in certain cases. In this chapter, we shall study one 
of these methods, known as “stationary perturbation theory”**. (In chapter XIII, 
we shall describe “time-dependent perturbation theory”, which is used to treat 
systems whose Hamiltonians contain explicitly time-dependent terms.) 

Stationary perturbation theory is very widely used in quantum physics, since 
it corresponds very well to the physicist's usual approach to problems. In studying 
a phenomenon or a physical system, one begins by isolating the principal effects 
which are responsible for the main features of this phenomenon or this system. 
When they have been understood. one tries to explain the “finer” details by taking 
into account less important effects that were neglected in the first approximation. 
It is in treating these secondary effects that one commonly uses perturbation 
theory. In chapter XII, we shall see, for example, the importance of perturbation 
theory in atomic physics: it will enable us to calculate the relativistic corrections 
in the case of the hydrogen atom. Similarly, complement By y, which is devoted to the 
helium atom, indicates how perturbation theory allows us to treat many-electron 
atoms. Numerous other applications of perturbation theory are given in the 
complements of this chapter and the following ones. 


* Of course, this phenomenon is not limited to the domain of quantum mechanics. In all fields 
of physics, there are very few problems which can be treated completely analytically. 

** Perturbation theory also exists in classical mechanics, where it is, in principle, entirely analogous 
to the one we shall describe here. 


1095 


www.elsolucionario.net 


CHAPTER XI STATIONARY PERTURBATION THEORY 


Let us mention, finally, another often used approximation method, the varia- 
tional method, which we shall present in complement E,,. We shall briefly examine 
its applications in solid state physics (complement F,,) and a molecular physics 
(complement Gx). 


A. DESCRIPTION OF THE METHOD 


1. Statement of the problem 


Perturbation theory is applicable when the Hamiltonian H of the system being 
studied can be put in the form: 


H=H,+W (A-1) 


where the eigenstates and eigenvalues of Ho are known, and where W is much 
smaller than H,. The operator H,, which is time-independent, is called the 
“unperturbed Hamiltonian” and W, the “perturbation”. If W is not time- 
dependent, we say that we are dealing with a “stationary perturbation” ; this is the 
case we are considering in this chapter (the case of time-dependent perturbations 
will be studied in chapter XIII). The problem is then to find the modifications 
produced in the energy levels of the system and in its stationary states by the 
addition of the perturbation W. 

When we say that W is much smaller than H,, this means that the matrix 
elements of W are much smaller than those of H,*. To make this more explicit, we 
shall assume that W is proportional to a real parameter 4 which is dimensionless and 
much smaller than |: 


W=i1W 
<1 (A-2) 


(where W is an operator whose matrix elements are comparable to those of Ho). 
Perturbation theory consists of expanding the eigenvalues and eigenstates of H in 
powers of 4, keeping only a finite number of terms (often only one or two) of these 
expansions. 

We shall assume the eigenstates and eigenvalues of the unperturbed 
Hamiltonian H, to be known. In addition, we shall assume, that the unperturbed 
energies form a discrete spectrum, and we shall label them by an integral index p: Ej. 
The corresponding eigenstates will be denoted by | Pp >, the additional index ¡ 
permitting us, in the case of a degenerate eigenvalue E°, to distinguish between 
the various vectors of an orthonormal basis of the associated eigensubspace. 
We therefore have: 


Hy |i) = E | 95> (A-3) 


* More precisely, the important point is that the matrix elements of W are much smaller than the 
differences between eigenvalues of H, (cf. comment of $ B-I-b). 
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where the set of vectors | o, > forms an orthonormal basis of the state space: 


< o, | TA > = O pp Ôi (A-4-a) 
22 10,><0p| =! (A-4-b) 


If we substitute (A-2) into (A-1), we can consider the Hamiltonian of the 
system to be continuously dependent on the parameter 4 characterizing the 
intensity of the perturbation: 


H(A) = Hy + AW (A-5) 


When å is equal to zero, H(A) is equal to the unperturbed Hamiltonian Ho. The 
eigenvalues E(A) of H(A) generally depend on A, and figure | represents possible 
forms of their variations with respect to 4. 

An eigenvector of H(A) is associated with each curve of figure 1. For a given 
value of A, these vectors form a basis of the state space | H(A) is an observable]. 
When 4 is much smaller than 1, the eigenvalues (A) and the eigenvectors | Y (4) > 
of H(A) remain very close to those of Hy = H(A = 0), which they approach 
when 4 —> 0. 


A E(A) 


0: 
E; a FIGURE 1 
Variation of the eigenvalues E(4) of the 
5 ! 7 Hamiltonian H(A) = H, + ŻW with res- 
E, pect to 2. Each curve corresponds to an 


| eigenstate of H(A). For 4 = 0, we obtain 

E == | the spectrum of Ho. We have assumed here 
Mi 30 that the eigenvalues £$ and E? are doubly 

i degenerate; application of the perturba- 

tion AW removes the degeneracy of E$, 


_ but not that of E?. An additional two-fold 
0 Ai A degeneracy appears for 4, = 4,. 


H(A) may, of course, have one or several degenerate eigenvalues. For example, 
in figure 1, the double curve represents a doubly degenerate energy (the one which 
approaches Eg when à —» 0), which corresponds, for all A, to a two-dimensional 
eigensubspace. It is also possible for several distinct eigenvalues E(4) to approach 
the same unperturbed energy Ef when à —=> 0* (this is the case for E$ in figure 1). 


* It is not impossible that additional degeneracies may appear for particular non-zero values of A 
(crossing at 4 = 4, in figure 1). We shall assume here that 4 is small enough to avoid such a situation. 
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We then say that the effect of the perturbation 1s to remove the degeneracy of the 
corresponding eigenvalue of Ho. 

In the following section, we shall give an approximate solution of the eigen- 
value equation of H(A) for 4 < 1. 


2. Approximate solution of the H(1) eigenvalue equation 


We are looking for the eigenstates | (A) > and eigenvalues E(A) of the 
Hermitian operator H(A): 


H(A) | w(2)> = EA) | y (A)> (A-6) 


We shall assume* that E(A) and | (A) > can be expanded in powers of A 
in the form: 


EA) = Ej + Ae, +. + Afe t (A-7-a) 
Ha) =|0> +4 11) +..+4|g)> +... (A-7-b) 


We then substitute these expansions, as well as definition (A-5) of H(A), into 
equation (A-6): 


(Ho + añ] S aja >| - | E | $, aJa >| (A-8) 


We require this equation to be satisfied for A small but arbitrary. We must 
therefore equate the coefficients of successive powers of A on both sides. Thus, we 
obtain: 

— for Oth-order terms in 4: 


H,|0> = €, |0> (A-9) 
— for 1st-order terms: 
(Ho — 80) 11) + (W- e,)]0> =0 (A-10) 
— for 2nd-order terms: 
(Ho — 60) 12) + (W-= e) ]1> — e, 10) =0 (A-11) 
— for gth-order terms: 


(Hy — £o) [a> + (W —#)|¢ -1> = e& |q- 2>... — e |0> =0 
(A-12) 


* This is not obvious from a mathematical point of view, the basic problem being the convergence 
of the series (A-7). 
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We shall confine ourselves here to the study of the first three equations, that is, 
we shall neglect, in expansions (A-7), terms of orders higher than 2 in 2. 
We know that the eigenvalue equation (A-6) defines | (A) > only to within 
a constant factor. We can therefore choose the norm of | yW(4)> and its phase: 
we shall require | yw (A) > to be normalized, and we shall choose its phase such that 


the scalar product < 0 | W(A) > is real. To Oth order, this implies that the vector 
denoted by | 0 > must be normalized: 


<0|0> =1 (A-13) 


Its phase, however, remains arbitrary ; we shall see in §§B and C how it can be 
chosen in each particular case. To Ist order, the square of the norm of | (A) > can 
be written: 


< WA) lyla =k 0|+4<1]][10> + 4/1>] + ol?) 
=(0|0)>+4[<1]/0> +< 0] 1>] + O(2?) (A-14) 


(where the symbol O(A?) signifies all the terms of order higher than or equal to p). 
Using (A-13), we see that this expression is equal to 1 to first order if the 4 term is 
zero. But the choice of phase indicates that the scalar product < 0 | 1 > is real (since å 
is real). We therefore obtain : 


<0|1> =<1]0> =0 (A-15) 
An analogous argument, for 2nd order in À, yields: 


(0]2) =<2]0)=-=3€1]1) (A-16) 


and, for qth order: 
<0|q> =<4|0> 
-Kq=111> t+{q—-2|2> +. 
+<2|qa—2>+¢1|q—1)] (A-17) 


When we confine ourselves to second order in A, the perturbation equations are 
therefore (A-9), (A 10) and (A-11). With the conventions we have set, we must 
add conditions (A-13), (A-15) and (A-16). 

Equation (A-9) expresses the fact that | 0 > is an eigenvector of H, with the 
eigenvalue £o. &, therefore belongs to the spectrum of Hy. This was to be expected, 
since each eigenvalue of H(A), when 4 —= 0, approaches one of the unperturbed 
energies. We then choose a particular value of gp, that is, an eigenvalue £? 
of H. As figure 1 shows, there can exist one or several different energies E(A) 
of H(A) which approach E? when 4 — 0. 

Consider the set of eigenstates of H(A) corresponding to the various 
eigenvalues E(4) which approach E? when 4 —» 0. They span a vector subspace 
whose dimension clearly cannot vary discontinuously when å varies in the 
neighborhood of zero. This dimension is consequently equal to the degeneracy g, 
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of £?. In particular, if E? is non-degenerate, it can give rise only to a single 
energy E(2), and this energy is non-degenerate. 

To study the influence of the perturbation W, we shall consider separately 
the case of non-degenerate, and degenerate levels of H,. 


B. PERTURBATION OF A NON-DEGENERATE LEVEL 


Consider a particular non-degenerate eigenvalue E? of the unperturbed 
Hamiltonian H,. Associated with it is an eigenvector | p, > which is unique to within 
a constant factor. We want to determine the modifications in this unperturbed 
energy and in the corresponding stationary state produced by the addition of the 
perturbation W to the Hamiltonian. 

To do so, we shall use perturbation equations (A-9) through (A-12), as well as 
conditions (A-13) and (A-15) through (A-17). For the eigenvalue of H(A) which 
approaches E? when 4 —> 0, we have: 


£o = ES (B-1) 
which, according to (A-9), implies that |0 > must be proportional to | p, >. The 
vectors | 0 > and | g, > are both normalized [cf. (A-13)], and we shall choose: 

[0> = |9,> (B-2) 


Thus, when å —> 0, we again find the unperturbed state | p, > with the same 
phase. 

We call E, (4) the eigenvalue of H(A) which, when 4 —=> 0, approaches the 
eigenvalue E? of H,. We shall assume å small enough for this eigenvalue to remain 
non-degenerate, that is, for a unique eigenvector | W,(4) > to correspond to it (in the 
case of the n = 2 level of figure 1, this is satisfied if A < 4, ). We shall now calculate 
the first terms of the expansion of E,(A) and | W,(4) > in powers of 2. 


1. First-order corrections 


We shall begin by determining s, and the vector | 1 > from equation (A-10) 
and condition (A-15). 


a. ENERGY CORRECTION 

Projecting equation (A-10) onto the vector | y, >. we obtain: 

< P, | (Ho — e) 1> +<0, |(W- 2,)[0> = 0 (B-3) 
The first term is zero, since |g, > = |0)> is an eigenvector of the Hermitian 


operator H, with the eigenvalue E? = e,. With (B-2) taken into account, equa- 
tion (B-3) then yields: 


e =< P| WO =<o,|W]o,> (B-4) 
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In the case of a non-degenerate state Ef, the eigenvalue E, (2) of H which 
corresponds to E? can be written, to first order in the perturbation W = AW: 


| EA) = £2 +<p,|W] Q,> +0(42) | (B-5) 


The first-order correction to a non-degenerate energy E? is simply equal to the mean 
value of the perturbation term W in the unperturbed state | p, >. 


b. EIGENVECTOR CORRECTION 


The projection (B-3) obviously does not exhaust all the information 
contained in perturbation equation (A-10). We must now project this equation onto 
all the vectors of the { | p; > } basis other than | p, >. We obtain, using (B-1) 
and (B-2): 


(gi | (Ho — ED 1> +< W—e)lo,>=0 (p#n) (B-6) 


(since the eigenvalues Ej other than E® can be degenerate, we must retain the 
degeneracy index i here). Since the eigenvectors of H, associated with different 
eigenvalues are orthogonal, the last term, e, < pi | Ø, >. is zero. Furthermore, in the 
first term, we can let H, act on the left on < gi} |. (B-6) then becomes : 


(E — E) Coill + ei |Wlo,> =0 (B-7) 


which gives the coefficients of the desired expansion of the vector | 1 > on all the 
unperturbed basis states, except | p, >: 


Co, li> = TALAKA (p #n) (B-8) 


E? — El 
The last coefficient which we lack, < pp, | 1 >, is actually zero, according to condi- 
tion (A-15), which we have not yet used [| ø, >,according to (B-2), coincides 
with |0>]: 

Co 1 =0 (B-9) 


We therefore know the vector | 1 > since we know its expansion on the {| pi > } 
basis: 


i W : 


Consequently, to first order in the perturbation W = AW, the eigenvector 
| W,(2) > of H corresponding to the unperturbed state | o, > can be written: 


ud) = 10.) + E SACL a? og) + 064) (B-11) 
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The first-order correction of the state vector is a linear superposition of all the 
unperturbed states other than |g, > : the perturbation W is said to produce a 
“mixing” of the state | Pn > with the other eigenstates of H,. The contribution 
of a given state lo; > is zero if the perturbation W has no matrix element 
between | p, > and [oe > In general, the stronger the coupling induced by W 
between | y, > and |) > (characterized by the matrix element < p$ | W | 9, >). i 
the closer the level E> to ne level E? under study, the greater the mixing with | gi, > 


COMMENT: 


We have assumed that the perturbation W is much smaller than the 
unperturbed Hamiltonian Ho, that is, that the matrix elements of W are 
much smaller than those of H,. It appears here that this hypothesis is not 
sufficient : the first order correction of the state vector is small only if the 
non-diagonal matrix elements of W are much smaller than the corresponding 
unperturbed energy differences. 


2. Second-order corrections 


The second-order corrections can be extracted from perturbation equa- 
tion (A-11) by the same method as above, with the addition of condition (A-16). 


a. ENERGY CORRECTION 


To calculate £,, we project equation (A-11) onto the vector | y, >, using (B-1) 
and (B-2): 


< Pa| (Ho — £2)|2> +e, (W — e,)] 1>) -e<o,lo,» =0 (B-12) 


For the same reason as in § B-l-a, the first term is zero. This is also the case 
for £; < p, | 1 >, since, according to (B-9), | 1 > is orthogonal to | g, >. We then get : 


that is, substituting expression (B-10) for the vector | | > 


yl p, | w Pn? (B-14) 


22 i ES = E? 


This result enables us to write the energy E, (4), to second order in the pertur- 
bation W = AW, in the form: 


E) = E+ o, Wie.) + Z RIM o Bas 
pan i Es = E; 
COMMENT: 


The second-order energy correction for the state | „> due to the 
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presence of the state | p; > has the sign of E? — Ef. We can therefore say 
that, to second order, the closer the state | pf > to the state | @, >, and the 
stronger the “coupling” |< pi | W | p, > |, the more these two levels “repel” 
each other. 


b. EIGENVECTOR CORRECTION 


By projecting equation (A-11) onto the set of basis vectors | p; > different 
from |g, >, and by using conditions (A-16), we could obtain the expression for 
the ket | 2 >, and therefore the eigenvector to second order. Such a calculation pre- 
sents no theoretical difficulties, and we shall not give it here. 


COMMENT: 


In (B-4), the first-order energy correction is expressed in terms of the 
zeroeth-order eigenvector. Similarly, in (B-13), the second-order energy 
correction involves the first-order eigenvector [which explains a certain 
similarity of formulas (B-10) and (B-14)]. This is a general result : by pro- 
jecting (A-12) onto | p, >, one makes the first term go to zero, which gives £, 
in terms of the corrections of order q — 1, q — 2, ... Of the eigenvector. This 
is why we generally retain one more term in the energy expansion than in 
that of the eigenvector : for example, the energy is given to second order 
and the eigenvector to first order. 


c. UPPER LIMIT OF £., 


If we limit the energy expansion to first order in 4. we can obtain an approxi- 
mate idea of the error involved by evaluating the second-order term which is simple 
to obtain. 

Consider expression (B-14) for ¢,. It contains a sum (which is generally 
infinite) of terms whose numerators are positive or zero. We denote by AE the 
absolute value of the difference between the energy E? of the level being studied 
and that of the closest level. For all n, we obviously have: 


|E? — En | > AE (B-16) 
This gives us an upper limit for the absolute value of g, : 
l nEn: 
ll <zp 2 Ilol W ol (B-17) 
pee í 


which can be written : 
1 de ka 
leo] SGE ELW |o W 1 Pn > 
pn i 
1 A f š 
APACE (B-18) 


pen i 
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The operator which appears inside the brackets differs from the identity operator 
only by the projector onto the state | p, >, since the basis of unperturbed states 
satisfies the closure relation: 


Lo, ><on | + Y lo < ob ]=1 (B-19) 


pén i 


Inequality (B-18) therefore becomes simply : 
1 n A 
leal < Jg l On ¡WI - le, >< Onl IW] 9, > 
1 P a , 
Sap LK On| WH? 1 n> — K On| W | Pn >] (B-20) 


Multiplying both sides of (B-20) by 4*, we obtain an upper limit for the 
second-order term in the expansion of E (å), in the form: 


e < — (AW)? (B-21) 


where AW is the root-mean-square deviation of the perturbation W in the unper- 
turbed state | p, >. This indicates the order of magnitude of the error committed by 
taking only the first-order correction into account. 


C. PERTURBATION OF A DEGENERATE STATE 


Now assume that the level E? whose perturbation we want to study is g,-fold 
degenerate (where g, is greater than 1, but finite). We denote by &° the corresponding 
eigensubspace of H,. In this case, the choice: 


£o ee E, (C-1) 


does not suffice to determine the vector | 0 >, since equation (A-9) can theoretically 
be satisfied by any linear combination of the g, vectors |g) > (i = 1. 2, .... gn). We 
know only that | 0 > belongs to the eigensubspace &° spanned by them. 

We shall see that, this time, under the action of the perturbation W, the 
level Ef generally gives rise to several distinct “sublevels”. Their number, /,, is 
between | and g,. If /, is less than g,, some of these sublevels are degenerate, since 
the total number of orthogonal eigenvectors of H associated with the /, sublevels 
is always equal to g,. To calculate the eigenvalues and eigenstates of the total Hamil- 
tonian H, we shall limit ourselves, as usual, to first order in å for the energies and 
to zeroeth order for the eigenvectors. 

To determine £, and | 0 >, we can project equation (A-10) onto the y, basis 
vectors | ø} >. Since the | pf > are eigenvectors of Ho with the eigenvalue E? = to, 
we Obtain the y, relations: 


(o |W|0> =, <p 0) (C-2) 
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We now insert. between the operator W and the vector | 0 >, the closure relation 
for the [| gi, > | basis : 


LE <a IW |g ><e|0> = 4 CALD (C-3) 
p Ë 
The vector | 0 >, which belongs to the eigensubspace associated with Ef, is orthogonal 
to all the basis vectors | œ, > for which p is different from ». Consequently, on the 
left-hand side of (C-3). the sum over the index p reduces to a single term (p = n). 
which gives : 


Un 


Y a loco |0> = 2, CaO (C-4) 


isl 


We arrange the gy? numbers < |W |o? > (where n is fixed and 
Li = 1,2, ga) Mag, X Y, Matrix of row index / and column index i’. This 
square matrix, which we shall denote by (HW) is, so to speak, cut out of the matrix 
which represents W in the { | pi > } basis : (W is the part which corresponds 
to 6. Equations (C-4) then show that the column vector of elements < oi | 0 > 
(i = 1, 2, ..., ga) is an eigenvector of (wer) with the eigenvalue ¢,. 

System (C-4) can, moreover, be transformed into a vector equation inside 6°. 
All we need to do is define the operator W the restriction of W to the 
subspace 82. W"” acts only in 87, and it is represented in this subspace by the 
matrix of elements < ¢; | W | pi’ >, thatis, by (W"y*. System (C-4) is thus equivalent 
to the vector equation: 


Ww" |0> = 2, |0> (C-5) 


[ We stress the fact that the operator W*" is different from the operator W of which 
it is the restriction: equation (C-5) is an eigenvalue equation inside 6°, and not 
in all space ]. 

Therefore, to calculate the eigenvalues (to first order) and the eigenstates 
(to zeroeth order) of the Hamiltonian corresponding to a degenerate unperturbed 
state E°, diagonalize the matrix (W"), which represents the perturbation** W, 
inside the eigensubspuce 82 associated with E?. 

Let us examine more closely the first-order effect of the perturbation W on 
the degenerate state E°. Let ef (j = 1, 2, .... 1.0?) be the various distinct roots of the 
characteristic equation of (W("). Since (W") is Hermitian, its eigenvalues are all 
real, and the sum of their degrees of degeneracy is equal to g, (g, > S). 
Each eigenvalue introduces a different energy correction. Therefore, under the 
influence of the perturbation W = ZW’, the degenerate level splits, to first order. 
into £ 0? distinct sublevels. whose energies can be written : 


7 70 Zal Fun d) 
E, (4) = Ey + 26; PEM |" Se (C-6) 
* If P, is the projector onto the subspace 6°, 17" can be written (complement B,,. $ 3): 
we = PFP, 


** (W"") is simply equal to AUD); this is why its eigenvalues yield directly the corrections hey. 
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If FW = g,, we say that, to first order, the perturbation W completely removes the 
degeneracy of the level E?. If fP < g,, the degeneracy, to first order, is only par- 
tially removed (or not at all if fP = 1). 


We shall now choose an eigenvalue s} of W"”, If this eigenvalue is non-degen- 
erate, the corresponding eigenvector | 0 > is uniquely determined (to within a phase 
factor) by (C-5) [or by the equivalent system (C-4)]. There then exists a single 
eigenvalue E(A) of H(A) which is equal to Ef + Aef, to first order, and this 
eigenvalue is non-degenerate*. On the other hand, if the eigenvalue el of WI” 
being considered presents a g-fold degeneracy, (C-5) indicates only that | 0 > belongs 
to the corresponding q-dimensional subspace ¥'$'. 


This property of e, can, actually, reflect two very different situations. One could distinguish 
between them by pursuing the perturbation calculation to higher orders of A, and seeing whether 
the remaining degeneracy is removed. These two situations are the following : 

(i) Suppose that there is only one exact energy E(A) which is equal, to first order, to 
E? + Asf, and that this energy is g-fold degenerate [in figure 1, for example, the energy E(A) 
which approaches E$ when ¿ —= 0 is two-fold degenerate, for any value of 4]. A g-dimensional 
eigensubspace then corresponds to the eigenvalue E(1), whatever 4, so that the degeneracy of the 
approximate eigenvalues will never be removed, to any order of A. 

In this case, the zeroeth-order eigenvector |0 > of H(A) cannot be completely specified, 
since the only condition imposed on | 0 > is that of belonging to a subspace which is the limit, 
when A —> 0, of the q-dimensional eigensubspace of H(A) corresponding to E(4). This limit is 
none other than the eigensubspace 4(* of (W™) associated with the eigenvalue e chosen. 

This first case often arises when H, and W possess common symmetry properties, implying 
an essential degeneracy for H(A). Such a degeneracy then remains to all orders in perturbation 
theory. 

(ii) It may also happen that several different energies E(A) are equal, to first order, to 
E? + dei (the difference between these energies then appears in a calculation at second or higher 
orders). In this case, the subspace F obtained to first order is only the direct sum of the limits, 
for 2 —> 0, of several eigensubspaces associated with these various energies E(A). In other 
words, all the eigenvectors of H(A) corresponding to these energies certainly approach kets 
of ¥$"), but, inversely, a particular ket of #{ is not necessarily the limit | 0 > of an eigenket 
of H(A). 

In this situation, going to higher order terms allows one, not only to improve the accuracy 
of the energies, but also to determine the zeroeth-order kets | 0 >. However, in practice, the partial 
information contained in equation (C-5) is often considered sufficient. 


COMMENTS: 


(i) When we use the perturbation method to treat all the energies** of the spec- 
trum of Ho, we must diagonalize the perturbation W inside each of the 
eigensubspaces &° corresponding to these energies. It must be understood 
that this problem is much simpler than the initial problem, which is the 
complete diagonalization of the Hamiltonian in the entire state space. Pertur- 
bation theory enables us to ignore completely the matrix elements of W between 
vectors belonging to different subspaces &°. Therefore, instead of having to 


* The proof of this point is analogous to the one which shows that a non-degenerate level of H, 
gives rise to a non-degenerate level of H(A) (cf. end of § A-2). 

** The perturbation of a non-degenerate state, studied in $B, can be seen as a special case of that 
of a degenerate state. 


1106 


(ii) 


(iii) 


www.elsolucionario.net 
C. PERTURBATION OF A DEGENERATE LEVEL 


diagonalize a generally infinite matrix, we need only diagonalize, for each of 
the energies E? in which we are interested, a matrix of smaller dimensions, 
generally finite. 


The matrix (W'”) clearly depends on the | | gi >! basis initially chosen 
in this subspace &° (although the eigenvalues and eigenkets of W"! obviously 
do not depend on it). Therefore, before we begin the perturbation calculation, 
it is advantageous to find a basis which simplifies as much as possible the 
form of (W'") for this subspace, and, consequently, the search for its 
eigenvalues and eigenvectors (the simplest situation is obviously the one in 
which this matrix is obtained directly in a diagonal form). To find such a basis. 
we often use observables which commute both with H, and W*. Assume 
that we have an observable A which commutes with H, and W. Since H, 
and A commute, we can choose for the basis vectors | gi > eigenstates common 
to Ho and A. Furthermore, since W commutes with A, its matrix elements 
are zero between eigenvectors of A associated with different eigenvalues. 
The matrix (W1”) then contains numerous zeros, which facilitates its diago- 
nalization. 


Just as for non-degenerate levels (cf. comment of $ B-1-b), the method described 
in this section is valid only if the matrix elements of the perturbation W 
are much smaller than the differences between the energy of the level under 
study and those of the other levels (this conclusion would have been evident 
if we had calculated higher-order corrections). However, it is possible to extend 
this method to the case of a group of unperturbed levels which are very close 
to each other (but distinct) and very far from all the other levels of the system 
being considered. This means, of course, that the matrix elements of the 
perturbation W are of the same order of magnitude as the energy differences 
inside the group, but are negligible compared to the separation between 
a level in the group and one outside. We can then approximately determine 
the influence of the perturbation W by diagonalizing the matrix which 
represents H = Ho + W inside this group of levels. It is by relying on an 
approximation of this type that we can, in certain cases, reduce the study 
of a physical problem to that of a two-level system, such as those described 
in chapter IV (3C). 


References and suggestions for further reading: 


For other perturbation methods, see, for example: 
Brillouin-Wigner series (an expansion which is simple for all orders but which involves 
the perturbed energies in the energy denominators): Ziman (2.26), $3.1. 
The resolvent method (an operator method which is well suited to the calculation of 
higher-order corrections): Messiah (1.17), chap. XVI, §IIL; Roman (2.3), §4-5-d. 
Method of Dalgarno and Lewis (which replaces the summations over the intermediate 
states by differential equations): Borowitz (1.7).§ 14-5; Schiff (1.18), chap. 8, $33. Original 
references : (2.34), (2.35), (2.36). 


* Recall that this does not imply that Ha and W commute. 
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The W.K.B. method, applicable to quasi-classical situations : Landau and Lifshitz 
(1.19), chap. 7; Messiah (1.17), chap. VI, $1I; Merzbacher (1.16), chap. VIT; Schiff (1.18), 
$34; Borowitz (1.7), chaps. 8 and 9. 

The Hartree and Hartree-Fock methods: Messiah (1.17), chap. XVIII, §H; 
Slater (11.8), chap. 8 and 9 (Hartree) and 17 (Hartree-Fock); Bethe and Jackiw (1.21), 
chap. 4. See also references of complement Ax yy. 
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COMPLEMENTS OF CHAPTER Xi 


Ax: 


Bx: 


Cy: 


Du: 


Eu: 


Fu: 


Gr: 


Hy: 


A ONE-DIMENSIONAL HARMONIC 
OSCILLATOR SUBJECTED 

TO A PERTURBING POTENTIAL 
IN x, x7, x? 


INTERACTION 
BETWEEN THE MAGNETIC DIPOLES 
OF TWO SPIN 1/2 PARTICLES 


VAN DER WAALS FORCES 


THE VOLUME EFFECT: THE INFLUENCE 
OF THE SPATIAL EXTENSION OF THE NUCLEUS 
ON THE ATOMIC LEVELS 


THE VARIATIONAL METHOD 


ENERGY BANDS OF ELECTRONS 
IN SOLIDS : A SIMPLE MODEL 


A SIMPLE EXAMPLE OF THE CHEMICAL BOND : 
THE H ION 


EXERCISES 


Ax, Bxp Cy, Dy,: illustrations of stationary 
perturbation theory using simple and important 
examples. 

Ax: study of a one-dimensional harmonic 
oscillator perturbed by a potential in x. 1%, x°. 
Simple, advised for a first reading. The last 
example (perturbing potential in x?) peritos ine 
study of the anharmonicity in the vibra: l 
a diatomic molecule (a refinemen: or Un meds 
presented in complement Ay). 

Bx,: can be considered as a worked example. 
illustrating perturbation theory for non-degen- 
erate as well as degenerate states. Familiarizos 
the reader with the dipole-dipole interaction 
between magnetic moments of two spin 1/2 
particles. Simple. 

Cx: study of the long-distance forces between 
two neutral atoms using perturbation theory 
(Van der Waals forces). The accent is placed on 
the physical interpretation of the results. A little 
less simple than the two preceding complements: 
can be reserved for later study. 

Dx: study of the influence of the nuclear 
volume on the energy levels of hydrogen-like 
atoms, Simple. Can be considered as a sequel of 
complement Ayi- 


Eç: presentation of another approximation 
method, the variational method. Important, since 
the applications of the variational method are 
very numerous. 


Ex. Gy: two important applications of the 
variational method. 

Fx: introduction, using the strong-bonding 
approximation, of the concept of an allowed 
energy band for the electrons of a solid. Essential, 
because of its numerous applications. Moderately 
difficult. The accent is placed on the interpreta- 
tion of the results. The view point adopted is 
different from that of complement O,, and 
somewhat simpler. 

Gx: studies the phenomenon «+: 
bond for the simplest possible case. thai ol the 
(ionized) H} molecule. Shows hew quanti 
mechanics explains the attractive forces bets een 
two atoms whose electronic wave functions over- 


clio tal 


lap. Includes a proof of the nial theorom. 
Essential from the point of view of chemical 


physics. Moderately difficult. 
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Complement Axı 


A ONE-DIMENSIONAL HARMONIC OSCILLATOR SUBJECTED 
TO A PERTURBING POTENTIAL IN x, x?, x? 


l. Perturbation by a linear potential 


a. The exact solution 
b. The perturbation expansion 


2. Perturbation by a quadratic potential 
3. Perturbation by a potential in x* 


a. The anharmonic oscillator 
b. The perturbation expansion 
c. Application : the anharmonicity of the vibrations of a diatomic molecule 


In order to illustrate the general considerations of chapter XI by a simple example. 
we shall use stationary perturbation theory to study the effect of a perturbing 
potential in x, x? or x° on the energy levels of a one-dimensional harmonic 
oscillator (none of these levels is degenerate, cf. chap. V). 

The first two cases (a perturbing potential in x and in x”) are exactly soluble. 
Consequently, we shall be able to verify in these two examples that the perturbation 
expansion coincides with the limited expansion of the exact solution with respect 
to the parameter which characterizes the strength of the perturbation. The last case 
(a perturbing potential in x*) is very important in practice for the following reason. 
Consider a potential V(x) which has a minimum at x = 0. To a first approximation, 
V(x) can be replaced by the first term (in x7) of its Taylor series expansion, in which 
case we are considering a harmonic oscillator and, therefore, an exactly soluble 
problem. The next term of the expansion of V(x), which is proportional to x°, then 
constitutes the first correction to this approximation. Calculation of the effect of the 
term in x*, consequently, is necessary whenever we want to study the anharmonicity 
of the vibrations of a physical system. It permits us, for example, to evaluate the 
deviations of the vibrational spectrum of diatomic molecules from the predictions 
of the (purely harmonic) model of complement Ay. 


1. Perturbation by a linear potential 


We shall use the notation of chapter V. Let: 
2 


1 
Ho = 5p t3" X? (1) 


be the Hamiltonian of a one-dimensional harmonic oscillator of eigenvectors [pn 
and cigenvalues* : 


* To specify that it is the unperturbed Hamiltonian that we are considering. we add. as in 
chapter XI, the index 0 to the eigenvalue of Ho. 
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E? = ¢ +3) (2) 


with n = 0, 1,2... 
We add to this Hamiltonian the perturbation: 


W = ¿ho X (3) 


where 4 is a real dimensionless constant much smaller than 1, and X is given by 
formula (B-1) of chapter V (since X is of the order of 1, RhwX is of the order of Ho 
and plays the role of the operator W of chapter XI). The problem consists of finding 
the eigenstates | y, > and eigenvalues E, of the Hamiltonian: 


H=H,+W (4) 


a. THE EXACT SOLUTION 


We have already studied an example of a linear perturbation in X : when the 
oscillator, assumed to be charged, ts placed in a uniform electric field $, we must 
add to H, the electrostatic Hamiltonian: 
h 
— X 


A eer (5) 


mo) 


where q is the charge of the oscillator. The effect of such a term on the stationary 
states of the harmonic oscillator was studied in detail in complement F,. It is 
therefore possible to use the results of this complement to determine the eigenstates 
and eigenvalues of the Hamiltonian H given by (4) if we perform the substitution: 


¿hw <> — qé | 


mo (6) 
Expression (39) of F, thus yields immediately: 
1 2 
E, = ¢ +5) — she (7) 


Similarly, we see from (40) of F, (after having replaced P by its expression in terms 
of the creation and annihilation operators w' and a): 


ase (5 (8) 


The limited expansion of the exponential then vields: 


> =| ee eee) + [len 
V2 
atil | 
: +1 E 

= (Pn) — ¿A Parr? NILA a (9) 
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b. THE PERTURBATION EXPANSION 


We replace X by ee + a) in (3) [cf. formula (B-7-a) of chapter V]. 
2 
We obtain: 


Wes =e) (10) 


ia 


W then mixes the state | py, > only with the two states | y,,, > and | y,_, >. The 
only non-zero matrix elements of W are, consequently : 


TORE 
In +1 
Ww SR 
< Pnr |W | e,> a 
7 
E li (11) 


According to general expression (B-15) of chapter XI, we have: 


|W K HE 
= El + W + [Cos] W | en >? Pn | n +. 12 
n= En +<0 1 Wlo,> PA E0 — Eo (12) 


Substituting (11) into (12) and replacing E? — Ef, by (n — n')ho, we immediately 
obtain: 


2 2 
ps +i ho +. 
1 2 
= (r +5) -Fto Pan (13) 


This shows that the perturbation expansion of the eigenvalue to second order in 2 
coincides* with the exact solution (7). 
Similarly, general formula (B-11) of chapter XI: 


ur 10) + E LO o) + (14) 


yields here: 


mn +i in 
NE (15) 


an expression which is identical to expansion (9) of the exact solution. 


* Tt can be shown that all terms of order higher than 2 in the perturbation expansion are zero. 
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2. Perturbation by a quadratic potential 


We now assume W to have the following form: 


— 


W =—phoX? = 5 pmo? X? (16) 


2 
where p is a real dimensionless parameter much smaller than 1. H can then be 
written : 


P? 1 2 2 
= Ze + 17 
H=H +W > 5 MO (1 + p)X (17) 


In this case, the effect of the perturbation is simply to change the spring constant 
of the harmonic oscillator. If we set: 


wm? = w*(1 + p) (18) 


we see that H is still a harmonic oscillator Hamiltonian, whose angular frequency 
has become w’. 

In this section, we shall confine ourselves to the study of the eigenvalues 
of H. According to (17) and (18), they can be written simply : 


E, = (n jw = (» +3)ño fl + p (19) 


that is, expanding the radical: 


ae 22 
E, = (r +3) 1 ler + .| (20) 


Let us now find result (20) by using stationary perturbation theory. 
Expression (16) can also be written: 


1 
pho(a? + a? + aa’ + aa) 


_! TF cae 
W = 7p hola + a) =] 


= 19 ho[a*? +a? +24a +1] (21) 


From this, it can be seen that the only non-zero matrix elements of W associated 
with | p, > are: 


1 l 
(On| Wo, > =3e(» +3) 


(or. 1W|0,> =F oln + In +23 ño (22) 


l 
Co 1 W [o> = Zola — 112 ño 
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When we use this result to evaluate the various terms of (12), we find: 


A PA oe Le ame O E 
b= Es sgh +5) 160 + I)(n + 2) 5 + TALU 1) 5 Pia 
l p l p? 
— F° Mot- -Miao 
= E, t(n +3) 5 (» +5 }he $ aa 
_ | p P 
= (» + spoli +5 3 + | (23) 


which indeed coincides with expansion (20). 


3. Perturbation by a potential in x* 
We now add to H, the perturbation: 


W = cha X> (24) 


where o is a real dimensionless number much smaller than 1. 


THE ANHARMONIC OSCILLATOR 


a. 
Figure | represents the variation with respect to x of the total potential 


2 max? + W(x) in which the particle is moving. The dashed line gives the parabolic 
potential 1 max? of the “unperturbed ” harmonic oscillator. We have chosen a < 0, 


so that the total potential (the solid curve in the figure) increases less rapidly 


for x > Othan for x < 0. 


1 
max + W(x) 


A 


FIGURE 1 

Variation of the potential associated with an anharmonic oscillator with respect to x. We treat the 
difference between the real potential (solid line) and the harmonic potential (dashed line) of the 
unperturbed Hamiltonian as a perturbation (x, and x, are the limits of the classical motion of 


energy £). 
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When the problem is treated in classical mechanics, the particle with total 
energy E is found to oscillate between two points, x4 and x, (fig. 1), which are no 
longer symmetrical with respect to O. This motion, while it remains periodic, is no 
longer sinusoidal: there appears, in the Fourier series expansion of x(t), a whole 
series of harmonics of the fundamental frequency. This is why such a system is 
called an “anharmonic oscillator” (its motion is no longer harmonic). Finally, let 
us point out that the period of the motion is no longer independent of the energy E, 
as was the case for the harmonic oscillator. 


b. THE PERTURBATION EXPANSION 


X. Matrix elements of the perturbation W 
a I : l 
We replace A' by a + a) in (24). Using relations (B-9) and (B-17) of 
2 


chapter V, we obtain, after a simple calculation: 


W = To [at +4 +3Nat +3(N + 1)a] (25) 


where N = a'a was defined in chapter V [formula (B-13)]. 
From this can immediately be deduced the only non-zero matrix elements 
of W associated with | y, >: 


1 
<On+3 | Ww | On > = q taet = 3)(n i E P no 
| 22 
< Pn- Wl|o,>= o| eh 
3 
COn |W |e, > = 3o(” = ~) 
3 
n\2 
@,-1|W]e,> = s0(5) ho (26) 


B. Calculation of the energies 


We substitute results (26) into the perturbation expansion of E, (formula 12). 
Since the diagonal element of W is zero, there is no first-order correction. The four 
matrix elemenis (26) enter, however, into the second-order correction. A simple 
calculation thus yields : 


7 PIE y a 
EE ¢ +5) -7I (» +3) ho — 6° ho +... (27) 
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The effect of W is therefore to lower the levels (whatever the sign of a). The 
larger n, the greater the shift (fig. 2). The difference between two adjacent levels 
is equal to: 


E, — Era = ol 1 — 5 en] (28) 


It is no longer independent of n, as it was for the harmonic oscillator. The energy 
states are no longer equidistant and move closer together as n increases. 


n+2 
a aaa a 
o 
See es 
a A 
n— l 
E AR ee ~ FIGURE 2 
Energy levels of H, (dashed lines) and of H (solid 
lines). Under the effect of the perturbation W, each 
n-2 level of H, is lowered, and the higher n, the greater 
Sea AA the shift. 
y. Calculation of the eigenstates 


Substituting relations (26) into expansion (14), we easily obtain: 


2 


a 
|. > =|, > — s0(" s+) ese +30(5) lea 


Sle + 3)(n + Kn + 27 bs 


1 


Pepa.) + o» 


Under the effect of the perturbation W, the state | y, > is therefore mixed with the 
states | a+ 1 >, PỌn-1 ES Pn+3 > and | Pn-3 >. 
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c. APPLICATION : THE ANHARMONICITY OF THE VIBRATIONS 
OF A DIATOMIC MOLECULE 


In complement Ay, we showed that a heteropolar diatomic molecule could 
absorb or emit electromagnetic waves whose frequency coincides with the vibrational 
frequency of the two nuclei of the molecule about their equilibrium position. If we 
denote by x the displacement r — r, of the two nuclei from their equilibrium 
position r,, the electric dipole moment of the molecule can be written: 


D(x) = de + dix +... (30) 


The vibrational frequencies of this dipole are therefore the Bohr frequencies which 
can appear in the expression for < X >(£). For a harmonic oscillator, the selection 
rules satisfied by X are such that only one Bohr frequency can be involved, the 
frequency w/2z (cf. complement Ay). 

When we take the perturbation W into account, the states | p, > of the oscillator 
are “mixed” [cf. expression (29)], and X can connect states | yw, > and | , > for 
which n' — n A + 1: new frequencies can thus be absorbed or emitted by the 
molecule. 

To analyze this phenomenon more closely, we shall assume that the molecule 
is initially in its vibrational ground state | Yo > (this is practically always the case 
at ordinary temperatures T since, in general, hw > kT). By using expression (29), 
we can calculate, to first order* in a, the matrix elements of X between the state | Yo > 
and an arbitrary state | y, >. A simple calculation thus yields the following matrix 
elements (all the others are zero to first order in 0): 


i 1 

o a (31-a) 
s 1 

lxi =e (31-b) 

<vol¥|vo> = -20 831-0) 


From this, we can find the transition frequencies observable in the absorption 
spectrum of the ground state. We naturally find the frequency : 


"e Lo (32-a) 
which appears with the greatest intensity since, according to (31-a), <y, | x Wo > 
is of zeroeth-order in c. Then, with a much smaller intensity [cf. formula (31-b)], 
we find the frequency : 

= La La (32-b) 


* lt would not be correct to keep terms of order higher than 1 in the calculation, since expansion (29) 
is valid only to first order in ø. 
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which is often called the second harmonic (although it is not rigorously equal to 
twice v,). 


COMMENT : 


Result (31-c) means that the mean value of Y is not zero in the ground state. This 
can easily be understood from figure i, since the oscillatory motion is no longer symmet- 
rical about O. If a is negative (the case in figure 1). the oscillator spends more time in 
the x > 0 region than in the x < 0 region, and the mean value of X must be positive. 
We thus understand the sign appearing in (31-c). 


The preceding calculation thus reveals only one new line in the absorption 
spectrum. Actually, the perturbation calculation could be pursued to higher 
orders in o, taking into account higher order terms in expansion (30) of the dipole 
moment D(x), as well as terms in x4, x5, ... in the expansion of the potential in the 
neighborhood of x = 0. All the frequencies : 


T es (33) 


with n = 3, 4, 5, ..., would then be found to be present in the absorption spectrum 
of the molecule (with intensities decreasing very rapidly with n). This would 
finally give, for this spectrum, the form shown in figure 3. This is what is actually 


FIGURE 3 


Form of the vibrational spectrum of a heteropolar diatomic molecule. Because of the anharmonicity 
of the potential and higher order terms in the power series expansion in x (the distance between the 
two atoms) of the molecular dipole moment D(x), a series of “ harmonic” frequencies v,. r}. .....¥,. ... 
appear in addition to the fundamental frequency r,. Note that the corresponding lines are not quite 
equidistant and that their intensity decreases rapidly with 7. 


1118 


www.elsolucionario.net 
HARMONIC OSCILLATOR: PERTURBATION IN x, x} x? 


observed. Note that the various spectral lines of figure 3 are not equidistant since, 
according to formula (28): 


v, — 0 E (10) (34) 

Vo y A — 1507?) (35) 

== (1- Lo) (36) 
which gives the relation: 

(v, — v) — yi = (v; — v) — v, — v,) = 0 (37) 


Thus we see that the study of the precise positions of the lines of the absorption 
spectrum makes it possible to find the parameter o. 


COMMENTS: 


(i) The constant č appearing in (52) of complement Fy, can be evaluated by using formula (27) 
of this complement. Comparing these two expressions and replacing n by v in (27), we 
obtain : 

15 
See 38 
¿= -20 (38) 
Now, the perturbing potential in Fy,, is equal to — gx?, while here we have chosen it 
equal to chw3, that is, equal to : 


1 
3,515 
o" ) x? (39) 


We therefore have: 


paz ol : j (40) 


which, substituted into (38). finally yields : 


¿8 (41) 


(ii) In the expansion of the potential in the neighborhood of x = 0, the term in x* is much 
smaller than the term in x° but it corrects the energies to first order, while the term in x? 
enters only in second order (cf. $3-b-P above). It is therefore necessary to evaluate these 
two corrections simultaneously (they may be comparable) when the spectrum of figure 3 
is studied more precisely. 


References and suggestions for further reading: 


Anharmonicity of the vibrations of a diatomic molecule: Herzberg (12.4), vol. I, 
chap. IIT, $2. 
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INTERACTION BETWEEN THE MAGNETIC DIPOLES 
OF TWO SPIN 1/2 PARTICLES 


1. The interaction Hamiltonian W 


a. The form of the Hamiltonian W. Physical interpretation 
b. An equivalent expression for W 
c. Selection rules 


2. Effects of the dipole-dipole interaction on the Zeeman sublevels of two fixed particles 


a. Case where the two particles have different magnetic moments 
b. Case where the two particles have equal magnetic moments 
c. Example: the magnetic resonance spectrum of Gypsum 


3. Effects of the interaction in a bound state 


In this complement, we intend to use stationary perturbation theory to study the 
energy levels of a system of two spin 1/2 particles placed in a static field B, and 
coupled by a magnetic dipole-dipole interaction. 

Such systems do exist. For example, in a gypsum monocrystal (CaSO,, 2H,O), 
the two protons of each crystallization water molecule occupy fixed positions, and 
the dipole-dipole interaction between them leads to a fine structure in the nuclear 
magnetic resonance spectrum. 

In the hydrogen atom, there also exists a dipole-dipole interaction between the 
electron spin and the proton spin. In this case, however, the two particles are 
moving relative to each other, and we shall see that the effect of the dipole-dipole 
interaction vanishes due to the symmetry of the Is ground state. The hyperfine 
structure observed in this state is thus due to other interactions (contact interaction ; 
cf. chap. XII, §§ B-2 and D-2 and complement Axi). 


1. The interaction Hamiltonian W 


a. THE FORM OF THE HAMILTONIAN W. PHYSICAL INTERPRETATION 


Let S, and S, be the spins of particles (1) and (2), and M, and M,, their 
corresponding magnetic moments : 


M, = 715, 

M, — 7,8, (1) 
[where y, and y, are the gyromagnetic ratios of (1) and (2)]. 

We call W the interaction of the magnetic moment M, with the field created 


by M, at (2). If n denotes the unit vector of the line joining the two particles and r, 
the distance between them (fig. 1), W can be written: 


LES, S, — 3/8, -m)(S, -n)] 2) 


Ho 
W==_ — 
dz Vive 3 
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The calculation which enables us to obtain expression (2) is in every way analogous 
to the one which will be presented in complement C,, and which leads to the expres- 
sion for the interaction between two electric dipoles (cf. p. 1132). 


Va FIGURE | 
M, 


Relative disposition of the magnetic moments M, 
and M, of particles (1) and (2) (r is the distance 
between the two particles, and n is the unit vector 
of the straight line between them). 


b. AN EQUIVALENT EXPRESSION FOR W 
Let 0 and ọ be the polar angles of n. If we set: 


Mo 7172 
(r) = — ae (3) 
we gel: 
W = &r){ 3[S,, cos 9 + sin 0 (S,, cos p + S,, sin @)] 
x [S,,cos@ + sin 0 (S, cos p + S, sin p)] — S, .S,) 


= ¿(r) {3[si.cose +5 sin 0(S,,e°% +S, | 


x E cos 0 T an USE +S, e*)| —§,.S, } (4) 


2 
that is: 
W= ANT ET +R AT, ET + Ta si 
where: 
Ty = (3 cos? 0 — 1)S,,S,, 
I = -306 cos? 9 — 1)(S,,S,- + S,_S2,) 
T, = 5 sin 0 cos 0 e~" (5,83, + SiS) 
(6) 
T, => sin O cos 0 e'*(S,,S,_ + S,_S,) 
3 2.2 — 2ip S 
T, = Ve Si+ 2+ 
3 - 2 pat 
T_, =7 sin? 9%? S, _S,- 
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Each of the terms 7, (or T,) appearing in (5) is, according to (6), the product 
of a function of O and y proportional to the second-order spherical harmonic Y? 
and an operator acting only on the spin degrees of freedom [the space and spin 
operators appearing in (6) are second-rank tensors; W, for this reason, is often 
called the “tensor interaction” ]. 


c. SELECTION RULES 


r, O and @ are the spherical coordinates of the relative particle associated 
with the system of two particles (1) and (2) The operator W acts only on these 
variables and on the spin degrees of freedom of the two particles. Let { | Onim)! 
be a standard basis in the state space 6, of the relative particle, and { | e,, €, > }, the 
basis of eigenvectors common to S, and S,- in the spin state space (e, = +,¢, = +). 
The state space in which W acts is spanned by the { | n.m > O | €,, €, > } basis, 
in which it is very easy, using expressions (5) and (6), to find the selection rules 
satisfied by the matrix elements of W. 


a. Spin degrees of freedom 
— T changes neither e, nor e). 
— T, “flips” both spins: 


+. => —=>|-=,+> and |-,+ > —=|+,-> 

— T, flips one of the two spins up: 

| =.> —> | +,& > or le, - > — |e, +> 

— Similarly, 7. , flips one of the two spins down: 

|+,6.> —|-,8,> or |a., +> — |en- > 

— Finally, T, and 7_, flip both spins up and down, respectively : 
|-,-—> —|+.+> and |+, +> —|-—. -> 


P. Orbital degrees of freedom 


When we calculate the matrix element of €(r)7, between the state | (,, , , > 
and the state | Ppr >, the following angular integral appears : 


| Y= (O, 9) ¥4(0, 4) YO, y) dQ (7) 


which, according to the results of complement Cy, is different from zero only for: 


Pehi-2,f+2 (8-a) 
m=m+q (8-b) 
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Note that the case / = /' = 0, although not in contradiction with (8), is excluded 
because we must always be able to form a triangle with /, /' and 2, which is impossible 
when / = l = 0. We must have then: 


Ll > (8-c) 


2. Effects of the dipole-dipole interaction on the Zeeman sublevels 
of two fixed particles 


In this section, we shall assume the two particles to be fixed in space. We 
shall therefore quantize only the spin degrees of freedom, considering the 
quantities r, 0 and y as given parameters, 

The two particles are placed in a static field B, parallel to Oz. The Zeeman 
Hamiltonian H,, describing the interaction of the two spin magnetic moments 
with B,, can then be written: 


Ho = wS: + wS; (9) 
with: 

w; = — Bo 

w, = — ¥2Bo (10) 


In the presence of the dipole-dipole interaction W, the total Hamiltonian H of the 
system becomes : 


H=H,+W (11) 


We shall assume the field B, to be large enough for us to be able to treat W like 
a perturbation. 


a. CASE WHERE THE TWO PARTICLES HAVE DIFFERENT MAGNETIC MOMENTS 
o. Zeeman levels and the magnetic resonance spectrum in the absence of inter- 
action 


According to (9). we have: 
h 
Hole, 82 > = 5 (6,0, + £02) | £1, £2 > (12) 


Figure 2-a represents the energy levels of the two-spin system in the absence of the 
dipole-dipole interaction (we have assumed œw; > œ, > 0). Since œ, # wz, these 
levels are all non-degenerate. 

If we apply a radio-frequency field B, cos wr parallel to Ox, we obtain a 
series of magnetic resonance lines. The frequencies of these resonances cor- 
respond to the various Bohr frequencies which can appear in the evolution of 
(Si, + Y2S,, > (the radio-frequency field interacts with the component along Ox 
of the total magnetic moment). The solid-line (dashed-line) arrows of figure 2-a 
join levels between which S,,(S,,) has a non-zero matrix element. Thus we see that 
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FIGURE 2 


Energy levels of two spin 1/2 particles, placed in a static field B, parallel to Oz. The two Larmor 
angular frequencies, w, = — 7,8) and w, = — y,B,,are assumed to be different. 

For figure a, the energy levels are calculated without taking account of the dipole-dipole inter- 
action W between the two spins. 

For figure b, we take this interaction into account. The levels undergo a shift whose approximate 
value, to first order in W, is indicated on the right-hand side of the figure. The solid-line arrows 
join the levels between which S,, has a non-zero matrix element, and the dashed-line arrows, those 
for which S,, does, 


+1 : 

Wy 4 01 
40 40 
> > 
Wz b w, 


FIGURE 3 


The Bohr frequencies which appear in the evolution of < S,, > and < S,, > give the positions of the 
magnetic resonance lines which can be observed for the two-spin system (the transitions corres- 
ponding to the arrows of figure 2). 

In the absence of a dipole-dipole interaction, two resonances are obtained, each one corresponding 
to one of the two spins (fig. a). The dipole-dipole interaction is expressed by a splitting of each of 
the two preceding lines (fig. b). 
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there are two distinct Bohr angular frequencies, equal to w, and w, (fig. 3-a), which 
correspond simply to the resonances of the individual spins, (1) and (2). 


B. Modifications created by the interaction 


Since all the levels of figure 2-a are non-degenerate, the effect of W can be 
obtained to first order by calculating the diagonal elements of W,<e,,e, | W]|e,, e, >. 
It is clear from expressions (5) and (6) that only the term 7, makes a non-zero contri- 
bution to this matrix element, which is then equal to: 


(ep e] W | £1, e, > = Er) (3 cos? 0 — ps = £,£, hQ (13) 
with: 

Q = 400 cos? 0 — 1) = Pod (3 cos? 0 — 1) (14) 
Since W is much smaller than A,, we have: 

Q%<0,— 0, (15) 


From this we can immediately deduce the level shifts to first order in W: AQ for 
|+, + >and | —, — >, and — AQ for | +, — > and for | —, + > (fig. 2-b). 

What now happens to the magnetic resonance spectrum of figure 3-a? If we 
are concerned only with lines whose intensities are of zeroeth order in W (that is, 
those which approach the lines of figure 2-a when W approaches zero), then to 
calculate the Bohr frequencies which appear in < S,, > and < S,, > we simply use the 
zeroeth-order expressions for the eigenvectors*. It is then the same transitions 
which are involved (compare the arrows of figures 2-a and 2-b). We see, however, 
that the two lines which correspond to the frequency w, in the absence of coupling 
(solid-line arrows) now have different frequencies: w, + 22 and w, — 28. 
Similarly, the two lines corresponding to œ, (dashed-line arrows) now have 
frequencies of w, + 22andw, — 22. The magnetic resonance spectrum is therefore 
now composed of two “doublets” centered at w, and w,, the interval between 
the two components of each doublet being equal to 4Q (fig. 3-b). 

Thus, the dipole-dipole interaction leads to a fine structure in the magnetic 
resonance spectrum, for which we can give a simple physical interpretation. The 
magnetic moment M, associated with S, creates a “local field” b at particle (2). 
Since we assume B, to be very large, S, precesses very rapidly about Oz, so we can 
consider only the S,, component (the local field created by the other components 
oscillates too rapidly to have a significant effect). The local field b therefore has 
a different direction depending on whether the spin is in the state | + > or | — >, 
that is, depending on whether it points up or down. It follows that the total field 


* If we used higher-order expressions for the eigenvectors, we would see other lines of lower 
intensity appear (they disappear when W ——> 0). 
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“seen” by particle (2), which is the sum of B, and b, can take on two possible 
values*. This explains the appearance of two resonance frequencies for the 
spin (2). The same argument would obviously enable us to understand the origin 
of the doublet centered at w,. 


b. CASE WHERE 
THE TWO PARTICLES HAVE EQUAL MAGNETIC MOMENTS 


%. Zeeman levels and the magnetic resonance spectrum in the absence of the 
interaction 


Formula (12) remains valid if we choose w, and œ, to be equal. We shall 
therefore set: 


0, =0,=0= — yB (16) 
1 2 0 


+, +> 
ho === 


a a: eee ee —— 0,0) 
ON fs Ny ud salia 
o. Joh Dt ao 


FIGURE 4 


The two spin 1/2 particles are assumed to have the same magnetic moment and, consequently, the 
same Larmor angular frequency « = — yBo. 


In the absence of a dipole-dipole interaction, we obtain three levels, one of which is two-fold degenerate 
(fig. a). Under the effect of the dipole-dipole interaction (fig. b), these levels undergo shifts whose 
approximate values (to first order in W) are indicated on the right-hand side of the figure. To 
zeroeth-order in W, the stationary states are the eigenstates | S, M > of the total spin. The arrows 
join the levels between which S,, + S,, has a non-zero matrix element. 


* Actually, since [B¿| > |b|, it is only the component of b along B, which is involved. 
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The energy levels are shown in figure (4-a). The upper level, | +, + >, and the 
lower level, | —, — >, of energies fi» and — hc), are non-degenerate. On the other 
hand, the intermediate level, of energy 0. is two-fold degenerate: to it correspond 
the two eigenstates | +, — > and | —, + >. 

The frequencies of the magnetic resonance lines can be obtained by finding 
the Bohr frequencies involved in the evolution of < S,, + S,, > (the total magnetic 
moment is now proportional to the total spin S = S, + S,). We easily obtain the 
four transitions represented by the arrows in figure 4-a, which correspond to 
a single angular frequency w. This finally yields the spectrum of figure 5-a. 


62 
<--> 
—+ a A = b 
@ (49) 


FIGURE 5 


Shape of the magnetic resonance spectrum which can be observed for a system of two spin 1/2 particles, 
with the same gyromagnetic ratio, placed in a static field By. 

In the absence of a dipole-dipole interaction, we observe a single resonance (fig. a). In the presence 
of a dipole-dipole interaction (fig. b), the preceding line splits. The separation 62 between the two 
components of the doublet is proportional to 3 cos? O — 1, where ( is the angle between the static- 
field B, and the straight line joining the two particles. 


B. Modifications created by the interaction 


The shifts of the non-degenerate levels | +, + > and | —. — > can be obtained 
as they were before, and are both equal to hQ | we must replace, however, y, and y, 
by y in expression (14) for 2]. 

Since the intermediate level is two-fold degenerate, the effect of W on this 
level can now be obtained by diagonalizing the matrix which represents the res- 
triction of W to the subspace {| +, — »,|—, + >}. The calculation of the 
diagonal elements is performed as above and vields: 


(+,-|W|4+.->=¢-.4+|Wl-.4+> =- (17) 


As for the non-diagonal element < +. —|W|-—, +, we easily see from 
expressions (5) and (6) that only the term Tọ contributes to it: 


(+,=|W|-,+>= - De cos? 0 — 1) 
x < +, — | (S1 +52- + S,_S,4)}-, +) 


2 
= — mia cos? ) — 1) = — AQ (18) 
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We are then led to the diagonalization of the matrix: 


_ no | r) (19) 
1 1 


whose eigenvalues are — 240 and 0, associated respectively with the eigenvectors 


Iy > =H. => +] =, +>) and [y> =a (J $= = 1. +>) 
V2 J2 
Figure 4-b represents the energy levels of the system of two coupled spins. 

The energies, to first order in W, are given by the eigenstates to zeroeth order. 
Note that these eigenstates are none other than the eigenstates | S, M > 
common to S? and S,, where S = S, + S, is the total spin. Since the operator S, 
commutes with S?, it can couple only the triplet states, that is, | 1,0 > to | 1, 1 > and 
|1,0> to | 1, — 1 >. This gives the two transitions represented by the arrows in 
figure 4-b, and to which correspond the Bohr frequencies œ + 32 and w — 30, The 
magnetic resonance spectrum is therefore composed of a doublet centered at œw, the 
separation between the two components of the doublet being equal to 62 (fig. 5-b). 


c. EXAMPLE : THE MAGNETIC RESONANCE SPECTRUM OF GYPSUM 


The case studied in § b above corresponds to that of two protons of 
a crystallization water molecule in a gypsum monocrystal (CaSO,, 2H,0). These 
two protons have identical magnetic moments and can be considered to occupy 
fixed positions in the crystal. Moreover, they are much closer to each other than 
to other protons (belonging to other water molecules). Since the dipole-dipole 
interaction decreases very quickly with distance (1/r? law), we can neglect inter- 
actions between protons belonging to other water molecules. 

The magnetic resonance spectrum is indeed observed to contain a doublet * 
whose separation depends on the angle O between the field B, and the straight line 
joining the two protons. If we rotate the crystal with respect to the field B,, this 
angle @ varies, and the separation between the two components of the doublet 
changes. Thus, by studying the variations of this separation, we can determine the 
positions of the water molecules relative to the crystal axes. 

When the sample under study is not a monocrystal, but rather a powder 
composed of small, randomly oriented monocrystals, 0 takes on all possible values. 
We then observe a wide band, due to the superposition of doublets having different 
separations. 


3. Effects of the interaction in a bound state 


We shall now assume that the two particles, (1) and (2), are not fixed, but can 
move with respect to each other. 

Consider, for example, the case of the hydrogen atom (a proton and an 
electron). When we take only the electrostatic forces into account, the ground state 


* Actually, ina Gypsum monocrystal, there are two different orientations for the water molecules, 
and. consequently. two doublets corresponding to the two possible values of 0. 
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of this atom (in the center of mass frame) is described by the ket | #4 o,o >. labeled 
by the quantum numbers n = 1, / = 0, m = 0 (cf. chap. VII). The proton and the 
electron are spin 1/2 particles. The ground state is therefore four-fold degenerate, 
and a possible basis in the corresponding subspace is made up of the four vectors: 


Li DD le (20) 


where e, and ¢,, equal to + or —, represent respectively the eigenvalues of S, and J, 
(S and I: the electron and proton spins). 

What is the effect on this ground state of the dipole-dipole interaction between S 
and 1? The matrix elements of W are much smaller than the energy difference 
between the ls level and the excited levels, so that it is possible to treat the effect of W 
by perturbation theory. To first order, it can be evaluated by diagonalizing the 
4 x 4 matrix of elements < P; 0.08182 | W | (M1 0 08,8, >» The calculation of these 
matrix elements, according to (5) and (6), involves angular integrals of the form: 


| Y9%(0, 9) ¥4(0, 9) ¥2(0, p) d2 (21) 


which are equal to zero, according to the selection rules established in $1-c above 
[in this particular case, it can be shown very simply that integral (21) is equal to 
zero: since Y9 is a constant, expression (21) is proportional to the scalar product 
of Y% and Y9, which is equal to zero because of the spherical harmonic orthogona- 
lity relations ]. 

The dipole-dipole interaction does not modify the energy of the ground state 
to first order. It enters, however, into the (hyperfine) structure of the excited levels 
with / > 1. We must then calculate the matrix elements < Ppi 6185 | W | 0, 1 ¡6182 », 
that is, the integrals: 


| Y7*(0, o) Y 40, o) Y7(0, p) dQ 


which, according to (8-c), become non-zero as soon as / > 1. 


References and suggestions for further reading: 
Evidence in nuclear magnetic resonance experiments of the magnetic dipole 


interactions between two spins in a rigid lattice: Abragam (14.1), chap. IV, §II and 
chap. VII, §IA; Slichter (14.2), chap. 3; Pake (14.6). 
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Complement Cx, 


VAN DER WAALS FORCES 


1. The electrostatic interaction Hamiltonian for two hydrogen atoms 


a. Notation 
b. Calculation of the electrostatic interaction energy 
2. Van der Waals forces between two hydrogen atoms in the Is ground state 
a. Existence of a — C/R® attractive potential 
b. Approximate calculation of the constant C 
c. Discussion 
3. Van der Waals forces between a hydrogen atom in the Is state and a hydrogen atom in the 2p state 
a. Energies of the stationary states of the two-atom system, Resonance effect 
b. Transfer of the excitation from one atom to the other 


4. Interaction of a hydrogen atom in the ground state with a conducting wall 


The character of the forces exerted between two neutral atoms changes with the 
order of magnitude of the distance R separating these two atoms. 

Consider, for example, two hydrogen atoms. When R is of the order of atomic 
dimensions (that is, of the order of the Bohr radius ag), the electronic wave functions 
overlap, and the two atoms attract each other, since they tend to form an H, molecule. 
The potential energy of the system has a minimum* for a certain value R, of the 
distance R between the atoms. The physical origin of this attraction (and therefore of 
the chemical bond) lies in the fact that the electrons can oscillate between the two 
atoms (cf. §§C-2-c and C-3-d of chapter IV). The stationary wave functions of the 
two electrons are no longer localized about only one of the nuclei; this lowers the 
energy of the ground state (cf. complement G,,). 

At greater distances, the phenomena change completely. The electrons can 
no longer move from one atom to the other, since the probability amplitude of such 
a process decreases with the decreasing overlap of the wave functions, that is, 
exponentially with the distance. The preponderant effect is then the electrostatic 
interaction between the electric dipole moments of the two neutral atoms. This 
gives rise to a total energy which is attractive and which decreases, not exponentially, 
but with 1/R*. This is the origin of the Van der Waals forces, which we intend to 
study in this complement by using stationary perturbation theory (confining 
ourselves, for the sake of simplicity, to the case of two hydrogen atoms). 

It should be clearly understood that the fundamental nature of Van der Waals 
forces is the same as that of the forces responsible for the chemical bond : the basic 
Hamiltonian is electrostatic in both cases. Only the variation of the energies of the 
quantum stationary states of the two-atom system with respec: to R allows us to 
define and differentiate these two types of forces. 


* At very short distances, the repulsive forces between the nuclei always dominate. 
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Van der Waals forces play an important role in physical chemistry, especially 
when the two atoms under consideration have no valence electrons (forces between 
rare gas atoms, stable molecules, etc.). They are partially responsible for the 
differences between the behavior of a real gas and that of an ideal gas. Finally. 
as we have already said. these are long-range forces, and are therefore involved 
in the stability of colloids. 

We shall begin by determining the expression for the dipole-dipole interaction 
Hamiltonian between two neutral hydrogen atoms ($1). This will then enable us 
to study the Van der Waals forces between two atoms in the Is state ($ 2). or between 
an atom in the 2p state and an atom in the Is state ($3). Finally, we shall show ($4) 
that a hydrogen atom in the Is state is attracted by its electrical mirror image in 
a perfectly conducting wall. 


1. The electrostatic interaction Hamiltonian 


for two hydrogen atoms 


a. NOTATION 


The two protons of the two hydrogen atoms are assumed to remain motionless 
at points 4 and B (fig. 1). We shall set: 


a t FIGURE 1 
n B 


Relative position of the two hydrogen 

R atoms. R is the distance between the two 

A protons, which are situated at 4 and B, 

and n is the unit vector on the line joining 

them. r, and r, are the position vectors of 

/ the two electrons with respect to points 4 
Fa and B respectively. 


R = OB - OA (1) 

R = |R] (2) 
R 

n= (3) 


R is the distance between the two atoms, and n is the unit vector on the line which 
joins them. Let r, be the position vector of the electron attached to atom (A) with 
respect to point 4. and rg. the position vector of the electron attached to atom B 
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with respect to B. We call: 


J, =qr; (4) 
Pg = qf; (5) 


the electric dipole moments of the two atoms (q is the electron charge). 
We shall assume throughout this complement that: 


R > ral, [rg] (6) 


Although they are identical, the electrons of the two atoms are well separated, and 
their wave functions do not overlap. It is therefore not necessary to apply the sym- 
metrization postulate (cf. chap. XIV, §D-2-b). 


b. CALCULATION OF THE ELECTROSTATIC INTERACTION ENERGY 


Atom (4) creates at (B) an electrostatic potential U with which the charges 
of (B) interact. This gives rise to an interaction energy Y, 

We saw in complement E, that U can be calculated in terms of R, n and the 
multipole moments of atom (4). Since (A) is neutral, the most important contri- 
bution to U is that of the electric dipole moment 2,. Similarly, since (B) is neutral, 
the most important term in # comes from the interaction between the dipole 
moment 2, of (B) and the electric field E = — VU which is essentially created by 2,4. 
This explains the name of “dipole-dipole interaction” given to the dominant term 
of 4. There exist, of course, smaller terms (dipole-quadrupole, quadrupole- 
quadrupole, etc.), and W' is written: 


W = Wig + Wag + Way + Wry t (7) 


To calculate Y,,, we shall start with the expression for the electrostatic 
potential created by 2, at (B): 


1 9, .R 
U(R) = mA i 
(R) Arty R? i 
from which we see: 
q 1 
E = — V,U = = — [r4 — 3(r,-n)n 9 
3 ERGO 2) 
and, consequently : 
2 
e 
Waa = “EH Mast + SNE A] (10) 


We have set e? = q?/4ne,, and we have used expressions (4) and (5) for 2, and Dz. 
In this complement, we shall choose the Oz axis parallel to n, so that (10) can be 
written : 
e? 
Waa = R? (x4X_ + YaYa — 22428) (11) 
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In quantum mechanics, #,, becomes the operator W,,, which can be obtained 
by replacing in (11) x4, Y y» .... Zg by the corresponding observables X,. Y}, .... Zp, 
which act in the state spaces 4, and 4, of the two hydrogen atoms* : 


a? 
Wa = moe + Y, — 22,2,) (12) 


2. Van der Waals forces between two hydrogen atoms 
in the 1s ground state 


a. EXISTENCE OF A - C,R° ATTRACTIVE POTENTIAL 
a. Principle of the calculation 


The Hamiltonian of the system is: 
H = Ho, + Hog + Wag (13) 


where H,, and Hog are the energies of atoms (4) and (B) when they are isolated. 
In the absence of W,,, the eigenstates of H are given by the equation: 


(Ho, + Hog) | De tae’ Qe tm’ >= (E, + E,,) | Ona OF ewe > (14) 


where the | nim > and the E, were calculated in $C of chapter VII. In particular, 
the ground state of Hy, + Hog is | 4.0.0: Oo.0 >. of energy — 2E,. It is non- 
degenerate (we do not take spins into account). 

The problem is to evaluate the shift in this ground state due to W,, and, in 
particular, its R-dependence. This shift represents, so to speak, the interaction 
potential energy of the two atoms tn the ground state. 

Since W,, is much smaller than //), and Hog, we can calculate this effect by 
stationary perturbation theory. 


B. First-order effect of the dipole-dipole interaction 
Let us show that the first-order correction : 


Ey =< PF ow: P7 o.o | Wis | 970.03 P7 oo > (15) 


is zero. e, involves, according to expression (12) for W,,, products of the form 
< vi o.o | Xa | P$.o.0 > < P?o.0 | Xe | 7.o. > (and analogous quantities in which 
X, is replaced by Y,, Z, and X, by Yp. Zg), which are zero, since, in a stationary state 
of the atom, the mean values of the components of the position operator are zero. 


* The translational external degrees of freedom of the two atoms are not quantized : for the 
sake of simplicity, we assume the two protons to be infinitely heavy and motionless. In (12), R is therefore 
a parameter and not an observable. 
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COMMENT: 


The other terms, W. Wa. W,.... of expansion (7) involve products of 
two multipole moments, one relative to (A) and the other one to (B), at least 
one of which is of order higher than 1. Their contributions are also zero 
to first order: they are expressed in terms of mean values in the ground state 
of multipole operators of order greater than or equal to one, and we know 
(cf. complement Ex, $2-c) that such mean values are zero in an / = 0 state 
(triangle rule of Clebsch-Gordan-coefficients). Therefore we must find the 
second-order effect of W. which then constitutes the most important energy 


correction. 


y. Second-order effect of the dipole-dipole interaction 


According to the results of chapter XI, the second-order energy correction 
can be written: 


oL Wo K Phim: Oimn | Waua | Too: P$ oo yl? 
m= 2 e E, (16) 
ntm 


where the notation 2” means that the state | pto .: P? o.o > is excluded from the 
summation *. 

Since W,, is proportional to 1/R*, e, is proportional to 1/ Rê. Furthermore, 
all the energy denominators are negative, since we are starting from the ground state. 
Therefore, the dipole-dipole interaction gives rise to a negative energy proportional 
to 1/R®: 

£ = — = (17) 
Van der Waals forces are therefore attractive and vary with 1/R”. 

Finally, let us calculate the expansion of the ground state to first order in W,,. 
We find, according to formula (B-11) of chapter XI: 


| Vo? = | 1.0.05 PŽ o.o > 


< Phim: Orm | Wat Ol oe. ree. 
Be r oå oi o8 ne 5 ndm ni .m dd 1,0,0 ° 1.0,0 aor (18 
| ahi d . a. 2E; da E, a Ex ) 


nim 
num 
COMMENT: 


The matrix elements appearing in expressions (16) and (18) involve the quantities 
€ Gaia | Xa oo >< Oe a | Xy | (1.0.0 > (and analogous quantities in which X, and 


* This summation is performed not only over the bound states. but also over the continuous 
spectrum of Hy, + Hoz 
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Xp are replaced by Y, and Y; or Z} and Z,), which are different from zero only if / = | 
and /' = 1. These quantities are indeed proportional to products of angular integrals 


| | Y7*(Q,) Y1(Q,) Yo(Q,) d0,] x | YT *(25) YT(Qs) Yo(Qs) do, | 


which, according to the results of complement C,. are zero if / 4 Lor F 4 1. We can 
therefore. in (16) and (18), replace / and /' by 1. 


b. APPROXIMATE CALCULATION OF THE CONSTANT C 
According to (16) and (12). the constant C appearing in (17) is given by: 


Cae y K Ds On tam | (X,¿Xg + Y¿Ya — 2Z 425) | Pioo ; P? oo >F 
nim 2E; + E, + Es 
ntm (19) 


We must have n > 2 and n' > 2. For bound states. |E,| = E,:m? is smaller 
than £,. and the error is not significant if we replace in (19) E, and E, by 0. 
For states in the continuous spectrum, E, varies between ( and + x. The matrix 
elements of the numerator become small, however, as soon as the size of E, becomes 
appreciable, since the spatial oscillations of the wave function are then numerous 
in the region in which ¢, o y (r) is non-zero. 

To have an idea of the order of magnitude of C, we can therefore replace all 
the energy denominators of (19) by 2£,. Using the closure relation and the fact 
that the diagonal element of W, is zero (§2-a-B). we then get : 


de 
4 
e m. E 
= QE, 4 Gia: P? 0.0 | (X4Xp + Y ¿Ya — 22425) | 3.0.0: 1.0.0 > (20) 
I 


This expression is simple to calculate: because of the spherical symmetry of 
the Is state, the mean values of the cross terms of the type X4 Y4. Xp Yp .... are zero. 
Furthermore, and for the same reason. the various quantities: 


< 07.0.0 | X4 letoo). (ota! Y? [Or eae = Piso l Zalot oo) 


are all equal to one third of the mean value of R} = X} + Y} + Z2. We finally 
obtain, therefore, using the expression for the wave function 9, y p(r): 


= 6 4 (21) 


C= Fe 0 Coto $ 


(where «1, is the Bohr radius) and, consequently: 


5 e? 5 
fy = — te e) (22) 
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The preceding calculation is valid only if a, < R (no overlapping of the wave 
functions). Thus we see that e, is of the order of the electrostatic interaction between 
two charges q and — q, multiplied by the reduction factor (ay/R)? < 1. 


c. DISCUSSION 


a. “Dynamical” interpretation of Van der Waals forces 


At any given instant, the electric dipole moment (we shall say, more simply, 
the dipole) of each atom has a mean value of zero in the ground state | o% o.o > 
or | o? o.o >. This does not mean that any individual measurement of a component 
of this dipole will yield zero. If we make such a measurement, we generally find 
a non-zero value; however, we have the same probability of finding the opposite 
value. The dipole of a hydrogen atom in the ground state should therefore be 
thought of as constantly undergoing random fluctuations. 

We shall begin by neglecting the influence of one dipole on the motion of the 
other one. Since the two dipoles are then fluctuating randomly and independently, 
their mean interaction is zero: this explains the fact that Wia has no first-order 
effect. 

However, the two dipoles are not really independent. Consider the electrostatic 
field created by dipole (4) at (8). This field follows the fluctuations of dipole (4). 
The dipole it induces at (B) is therefore correlated with dipole (A), so the electrostatic 
field which “returns” to (A) is no longer uncorrelated with the motion of dipole (4). 
Thus, although the motion of dipole (A) is random, its interaction with its own field, 
which is “reflected” to it by (B),does not have a mean value of zero. This is the 
physical interpretation of the second-order effect of Wa. 

The dynamical aspect is therefore useful for understanding the origin of 
Van der Waals forces. If we were to think of the two hydrogen atoms in the ground 
state as two spherical and “static” clouds of negative electricity (with a positive 
point charge at the center of each one), we would be led to a rigorously zero inter- 
action energy. 


B. Correlations between the two dipole moments 


Let us show more precisely that there exists a correlation between the two 
dipoles. 

When we take W,, into account, the ground state of the system is no longer 
| of .0.0: 00.0 >» but | Yo > [ef. expression (18)]. A simple calculation then yields : 


{Wo | Xa [Wo > = - = <Wo| Z| o> = 0 (23) 


to first order in Wa- 


Consider, for example, < Yo | X4 | Wo >. The zeroeth-order term, 
< 140,03 4.0.0 | Xa | Pioo; 1.0.0 > 


is zero, since it is equal to the mean value of X, in the ground state | pf o.o >. To first order, the 
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summation appearing in formula (18) must be included. Since W contains only products of the 
form XX y, the coefficients of the kets | Pf a 0: OP ¡»and | 9%, OF oa > in this summation 
are zero. The first-order terms which could be different from zero are therefore proportional to 


< Phim? Ont am | Xa | 10,05 Poo > with /#0 and 0; 


These terms are all zero since X4 does not act on |pf y. > and < PBa m | Qe oo > = 0 for 
#0. 


Thus, even in the presence of an interaction, the mean values of the components 
of each dipole are zero. This is not surprising : in the interpretation of $ 2-c-a, the 
dipole induced in (B) by the field of dipole (4) fluctuates randomly, like this field, 
and has, consequently, a mean value of zero. 

Let us show, on the other hand, that the two dipoles are correlated, by eva- 
luating the mean value of a product of two components, one relative to dipole (A) and 
the other, to dipole (B). We shall calculate < Wo | (X,Xg + Y, Yp — 2Z,Zg)| Wo >. 

3 


for example, which, according to (12), is nothing more than ce < Wo | Wal Wo >. 
e? 


Using (18), we immediately find, taking (15) and (16) into account, that : 
R3 
< Wo | (X, Xp + Ma Ya — 22425) | Wo > = w177 #0 (24) 


Thus, the mean values of the products X ,X,, Yı Yp and Z,Z, are not zero, as would 
be the products of mean values < X> < Xp» LL) lh <2Z4> lZ? 
according to (23). This proves the existence of a correlation between the two dipoles. 


Y. Long-range modification of Van der Waals forces 


The description of $2-c-x above enables us to understand that the preceding 
calculations are no longer valid if the two atoms are too far apart. The field 
produced by (A) and “reflected” by (B) returns to (A) with a time lag due to the 
propagation (4) —> (B) —> (A), and we have argued as if the interactions were 
instanlaneous. 

We can see that this propagation time can no longer be neglected when it 
becomes of the order of the characteristic times of the atom’s evolution, that is. of the 
order of 22/w,,, Where w,, = (E, — E,)/h denotes a Bohr angular frequency. In other 
words, the calculations performed in this complement assume that the distance R 
bet ween the two atoms is much smaller than the wavelengths 2xc/w, , of the spectrum 
of these atoms (about 1 000 A), 

A calculation which takes propagation effects into account gives an interaction 
energy which, at large distances, decreases as 1;R’. The [/R® law which we have 
found therefore applies to an intermediate range of distances, neither too large 
(because of the time lag) nor too small (to avoid overlapping of the wave functions). 
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3: Van der Waals forces between a hydrogen atom 
in the 1s state and a hydrogen atom in the 2p state 


a. ENERGIES OF THE STATIONARY STATES OF THE TWO-ATOM SYSTEM. 
RESONANCE EFFECT 


The first excited level of the unperturbed Hamiltonian H,, + Ho, ts eight- 

tole degenerate. The associated EP RS uo Spake y spanned by the eight states 

EE Pia yA Lo? 0.05 PL, oo 2: oi. 0.0* pr, tm > With a = — 1,0. +1: 

0%. tom’ > Pioo >» With mó = — 1,0, + | which correspond to a situation in 

which one of the two atoms is in the g ground state, while the other one is in a state of 
the n = 2 level. 

According to perturbation theory for a degenerate state, we must diagonalize 
the 8 x 8 matrix representing the restriction of W, to the eigensubspace to obtain 
the first-order effect of W. We shall show that the only non-zero matrix elements 
of W,, are those which connect a state | o4 0.0; 92.1. > to the state | pfi mi 92.90. 
The operators X ,, Y}. Z4 appearing in the expression for W are odd and can there- 
fore couple | {o.o > only to one of the | v7, m >; an analogous argument is valid 
for Xp, Yp, Zg. Finally, the dipole-dipole interaction is invariant under a rotation 
of the two atoms about the Oz axis which joins them; therefore W, commutes 
with L,. + Lp- and then can only join two states for which the sum of the eigen- 
values of L,, and Lp, is the same. 

Therefore, the: preceding 8 x 8 matrix can be broken down into four 
2 x 2 matrices. One of them is entirely zero (the one which concerns the 2s states), 
and the other three are of the form: 


(,.° a es) 
k,/R3 0 


where we have set : 


Km 
< Pf oo: PŽ im | Waa | Pfam: P7 o.o p= 


25 (26) 


k,, is a calculable constant, of the order of e?a3, which we shall not treat here. 
We can immediately diagonalize matrix (25), obtaining the eigenvalues 
+ k„/R? and — k,,/R*, associated respectively with the eigenstates : 


l 
l roo: Om? +| Pfam: 1.0.0 >) 
and 
l A . B A B 
Y (| Pr.o.05 21m? — Oo ws PT oo >) 


This reveals the following important results : 
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— The interaction energy varies with 1/R* and not with I/R°, since W,, now 
modifies the energies to first order. The Van der Waals forces are therefore more 
important than they are between two hydrogen atoms in the Is state (resonance 
effect between two different states of the total system with the same unperturbed 
energy). 

— The sign of the interaction can be positive or negative (eigenvalues + k,,/R* 
and — k,/R*). There therefore exist states of the two-atom system for which there 
is attraction, and others for which there is repulsion. 


b. TRANSFER OF THE EXCITATION FROM ONE ATOM TO THE OTHER 


The two states | of 0.0: 03.1. > and | 04 i.m; @7.o.o > have the same unper- 
turbed energy and are coupled by a non-diagonal perturbation. According to the 
general results of $C of chapter IV (two-level system). we know that there is 
oscillation of the system from one level to the other with a frequency proportional 
to the coupling. 

Therefore, if the system starts in the state | pg 003 024) at 1 =0, it 
arrives, after a certain time (the larger R, the longer the time), in the state 
| 0% 1m3 O Lo. >» The excitation thus passes from (B) to (A), then returns to (B), 
and so on. 


COMMENT: 


If the two atoms are not fixed but, for example, undergo collision, R varies over 
time and the passage of the excitation from one atom to the other is no longer periodic. 
The corresponding collisions, called resonant collisions, play an important role in the 
broadening of spectral lines 


4. Interaction of a hydrogen atom in the ground state 
with a conducting wall 


We shall now consider a single hydrogen atom (4) situated at a distance d from 
a wall which is assumed to be perfectly conducting. The Oz axis is taken along 
the perpendicular to the wall passing through A (fig. 2). The distance d is assumed 
to be much larger than the atomic dimensions, so that the atomic structure of the 
wall can be ignored, and we can assume that the atom interacts with its electrical 
image on the other side of this wall (that is, with a symmetrical atom with opposite 
charges). The dipole interaction energy between the atom and the wall can easily 
be obtained from expression (12) for W, by making the following substitutions : 


e — -e 

R —= 2d 

Xg —> X= Xa (27) 
Ya — Y, = Y, 

Ly —> 24> > Z, 


(the change of e? to — e? is due to the sign difference of the image charges). 
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FIGURE 2 


To calculate the interaction energy of a 
hydrogen atom with a perfectly conducting 
wall, we can assume that the electric dipole 
moment qr, of the atom interacts with its 
electrical image — gr, (d is the distance 
between the proton 4 and the wall). 


Furthermore it is necessary to divide by 2 since the dipole image is fictitious and the 
electric field zero below the xOy plane. We then get: 


2 
yas Er (XZ + Y} +222) (28) 


which represents the interaction energy of the atom with the wall [W acts only 
on the degrees of freedom of (4)]. 
If the atom is in its ground state, the energy correction to first order in W is 


then: 
E, = Pioo W | Pioo > (29) 


Using the spherical symmetry of the Is state, we obtain: 


, 22 R2 pia? 
€; = — Faqs 44 1.0.01"! 91.0.0) = => a (30) 


We see that the atom is attracted by the wall: the attraction energy varies with 1af. 
and. therefore, the force of attraction varies with lid”. 

The fact that W has an effect even to first order can easily be understood 
in terms of the discussion of $ 2-c above. In this case, there is a perfect correlation 
between the two dipoles, since they are images of each other. 


References and suggestions for further reading: 


Kittel (13.2), chap. 3. p. 82:. Davydov (1.20) chap. XII. $8124 and 125: 
Langbein (12.9). 

For a discussion of retardation effects. see: Power (2.11), $87.5 and 8.4 (quantum 
electrodynamic approach): Landau and Lifshitz (7.12). chap. XI. $90 (electromagnetic 
fluctuation approach). 

See also Derjaguin’s article (12.12). 
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Complement Di; 


THE VOLUME EFFECT : THE INFLUENCE OF THE SPATIAL 
EXTENSION OF THE NUCLEUS ON THE ATOMIC LEVELS 


l. First-order energy correction 
a. Calculation of the correction 
b. Discussion 

2. Application to some hydrogen-like systems 
a. The hydrogen atom and hydrogen-like ions 
b. Muonic atoms 


The energy levels and the stationary states of the hydrogen atom were studied 
in chapter VII by assuming the proton to be a charged point particle, which 
creates an electrostatic l/r Coulomb potential. Actually, this is not quite true. 
The proton is not strictly a point charge; its charge fills a volume which has 
a certain size (of the order of I fermi = 107 1° cm). When an electron is extremely 
close to the center of the proton, it “sees” a potential which no longer varies with 1/r 
and which depends on the spatial charge distribution associated with the proton. 
This is true, furthermore, for all atoms: inside the volume of the nucleus, the elec- 
trostatic potential depends on how the charges are distributed. We thus expect the 
atomic energy levels, which are determined by the potential to which the electrons 
are subject at all points of space, to be affected by this distribution : this is what is 
called the “volume effect”. The experimental and theoretical study of such an 
effect is therefore important, since it can supply information about the internal 
structure of nuclei. 

In this complement. we shall give a simplified treatment of the volume effect 
of hydrogen-like atoms. To have an idea of the order of magnitude of the energy 
shifts it causes, we shall confine ourselves to a model in which the nucleus is 
represented by a sphere of radius pọ, in which the charge — Zq is uniformly 
distributed. In this model, the potential created by the nucleus is (c/. complement Ay, 
$ 4-b): 


Ze? 
a for r > Po 


2 2 
Ec (2) 2 3 for r < po (1) 
2Po | \Po 


(we have set e? = q?/4ne,). The shape of the variation of V(r) with respect to r 
is shown in figure 1. 

The exact solution of the Schródinger equation for an electron subject to 
such a potential poses a complicated problem. Therefore, we shall content ourselves 
with an approximate solution, based on perturbation theory. In a first approxima- 
tion, we shall consider the potential to be a Coulomb potential [which amounts to 


V(r) = 
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A V(r) 


FIGURE 1 


Variation with respect to r of the electro- 
static potential V(r) created by the charge 
distribution — Zq of the nucleus, assumed 
to be uniformly distributed inside a sphere 
of radius py. For r < po, the potential is 
parabolic. For r > po, it is a Coulomb 
potential [the extension of this Coulomb 
potential into the r < pọ zone is repre- 
sented by the dashed line; W(r) is the 
difference between V(r) and the Coulomb 
potential]. 


setting pọ = 0 in (1)]. The energy levels of the hydrogen atom are then the ones 
found in $ C of chapter VII. We shall treat the difference W(r) between the 
potential V(r) written in (1) and the Coulomb potential as a perturbation. This 
difference is zero when r is greater than the radius pọ of the nucleus. It is therefore 
reasonable that it should cause a small shift in the atomic levels (the corresponding 
wave functions extend over dimensions of the order of a, > pọ), which justifies 
a treatment by first-order perturbation theory. 


1. First-order energy correction 
a. CALCULATION OF THE CORRECTION 


By definition. W(r) is equal to: 


Zef /r\  2po | 
Wir) = < =||—])+-3]|ifO<r<p 
t) e(z r ° 
0 


ifr > py (2) 


Let | y, 1» > be the stationary states of the hydrogen-like atom in the absence of the 
perturbation W. To evaluate the effect of W to first order, we must calculate the 
matrix elements : 


< Prim | W | On w > = [an Yr*(9) Y” (Q) 
» | 12 dr R* (r) Ry rlr) Wr) 6) 
0 
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In this expression, the angular integral simply gives 0,,0,,,,. To simplify the radial 
integral, we shall make an approximation and assume* that : 


Po < ag (4) 


that is, that the r < po region, in which W(r) is not zero, is much smaller than the 
spatial extent of the functions R, (r). When r < po, we then have: 


Ra, a(r) = R,„ (0) (5) 


The radial integral can therefore be written : 


Ze? EN r\ 2p 
I = =— |R, (0)? r? dr|{—} +2 -3 6 
205 | nll J | (5) r | ( ) 


which gives : 


Ze? , ; 
and: 
Ze? , ee , 
< Po. im | wW | Ontw > _ 10 Po [R,, (0)| On Onin? (8) 


We see that the matrix representing W in the subspace &, corresponding to 
the nth level of the unperturbed Hamiltonian is diagonal. Therefore, the first-order 
energy correction associated with each state | p, ;.,, > can be written simply: 


Ze , 
AE, 1 = Hp Po RO)? (9) 


This correction does not depend on m**, Furthermore, since R, ,(0) is zero 
unless / = 0 (cf. chap. VII, $ C-4-c), only the s states (/ = 0 states) are shifted, 


by a quantity which is equal to: 


Ze? , 
AE,, 0 = o Po [R,,o(0)]? 


2nZe? , 
== Po |Pn.0,0(0)” (10) 


(we have used the fact that Y} = 1/4/47). 


* This is certainly the case for the hydrogen atom, In $2, we shall examine condition (4) in greater 
detail. 

** This result could have been expected. since the perturbation W, which is invariant under 
rotation, is a scalar (cf. complement By,, $5-b). 
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b. DISCUSSION 


AE, y can be written: 


AE, o == w2 (11) 


2 
apa (12) 


Po 


is the absolute value of the potential energy of the electron at a distance po from the 
center of the nucleus, and: 


4 : 
P = 3700 |Pn,0.0(0)|° (13) 


is the probability of finding the electron inside the nucleus. :2 and w enter into (11) 
because the effect of the perturbation W(r) is felt only inside the nucleus. 

For the method which led us to (10) and (11) to be consistent. the correc- 
tion AE, ¿ must be much smaller than the energy differences between unperturbed 
levels. Since w is very large (an electron and a proton attract each other very 
strongly when they are very close), 42 must therefore be extremely small. Before 
taking up the more precise calculation in $2, we shall evaluate the order of 
magnitude of these quantities. Let : 


he 
ao(Z) = Zme? 


(14) 


be the Bohr radius when the total charge of the nucleus is — Zq. If n is not too 
high. the wave functions €, 9 o{r) are practically localized inside a region of space 
whose volume is approximately [a,¿(Z)]?. As for the nucleus, its volume is of the 
order of pj, so: 


ip ~ Po : 
l 0 


Relation (11) then yields : 


2 3 
Ze | p 
AE, , = 2 
ds Po Ea 


_ Ze [oo | 
7 ao(Z) EA ue) 


Now, Ze?/d (Z) is of the order of magnitude of the binding energy E,(Z) of the unper- 
turbed atom. The relative value of the correction is therefore equal to : 


AE „o Ay Po j 
EZ) — Fea (17) 
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If condition (4) is met, this correction will indeed be very small. We shall now cal- 
culate it more precisely in some special cases. 


2. Application to some hydrogen-like systems 


a. THE HYDROGEN ATOM AND HYDROGEN-LIKE IONS 


For the ground state of the hydrogen atom, we have [c/. chap. VII, 
relation (C-39-a) |: 


Ri off) = 2(a,)7*? E (18) 


where a, is obtained by setting Z = 1 in (14)]. Formula (10) then gives: 
E 


z 2 e? (po > 4 Po\ 
AE o E” (2) -E(f (19) 


Now, we know that, for hydrogen : 


dy =~ 0.53 Å = 5.3 x 107!!m (20) 
Furthermore, the radius p, of the proton is of the order of: 

polproton) = 1 F = 10°'Sm (21) 
If we substitute these numerical values into (19), we obtain: 

AE o = 4.5 x 10 "E ~6 x 10 %eV (22) 


The result is therefore very small. 

For a hydrogen-like ion, the nucleus has a charge of — Zq. We can then 
apply (10), which amounts to replacing e? in (19) by Ze?, and a, by a.(Z) = ao/Z. 
We obtain: 


2 Ze? [pA Z i 
AE, (Z) = soe | ee?) x z| (23) 
0 0 


where pọ(4, Z) is the radius of the nucleus, composed of A nucleons (protons or 
neutrons), Z of which are protons. In practice, the number of nucleons of a 
nucleus is not very different from 2Z; in addition, the “nuclear density saturation” 
property is expressed by the approximate relation: 


pA, Z)x Alea Z!” (24) 


The variation of the energy correction with respect to Z is then given by : 


AB, o(Z) œ ZA (25) 
or: 

AE, o(Z) 83 

Elz) * Z (26) 
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AE, 9(Z) therefore varies very rapidly with Z, under the effect of several concordant 
factors: when Z increases, a, decreases and p, increases. The volume effect is 
therefore significantly larger for heavy hydrogen-like ions than for hydrogen. 


COMMENT: 


The volume effect also exists for all the other atoms. It is responsible for 
an isotopic shift of the lines of the emission spectrum. For two distinct isotopes 
of the same chemical element, the number Z of protons of the nucleus is the 
same, but the number A — Z of neutrons is different; the spatial distri- 
butions of the nuclear charges are therefore not identical for the two nuclei. 

Actually, for light atoms, the isotopic shift is caused principally by 
the nuclear finite mass effect (cf. complement Ayp. $1-a-x). On the other 
hand, for heavy atoms (for which the reduced mass varies very little from one 
isotope to another), the finite mass effect is small; however, the volume 
effect increases with Z and becomes preponderant. 


b. MUONIC ATOMS 


We have already discussed some simple properties of muonic atoms 
(cf. complement Ay. $4 and Ayp. 92-4). In particular, we have pointed out that 
the Bohr radius associated with them is distinctly smaller than for ordinary atoms 
(this is caused by the fact that the mass of the y” muon is approximately equal 
to 207 times that of the electron). From the qualitative discussion of $1-b. we may 
therefore expect an important volume effect for muonic atoms. We shall evaluate 
it by choosing two limiting cases: a light muonic atom (hydrogen) and a heavy 
one (lead). 


o. The muonic hydrogen atom 


The Bohr radius is then: 


_ a 
au ,p* = 207 (27) 


that is, of the order of 250 fermi. It therefore remains, in this case, distinctly greater 
than pọ. If we replace a, by a,/207 in (19), we find: 
AE, (u.p')=~1.9 x 10° x Elu ,p*)=5 x 10? eV (28) 


Although the volume effect is much larger than for the ordinary hydrogen atom, 
it still yields only a small correction to the energy levels. 


B. The muonic lead atom 

The Bohr radius of the muonic lead atom is [cf. complement Ay, rela- 
tion (25)]: 

aolu”, Pb) =3F = 3 x 10° ‘Sm (29) 


The y” muon is now very close to the lead nucleus; it is therefore practically 
unaffected by the repulsion of the atomic electrons which are located at distinctly 
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greater distances. This could lead us to believe that (10), which was proven for 
hydrogen-like atoms and ions, is directly applicable to this case. Actually, this is not 
true, since the radius of the lead nucleus is equal to: 


po(Pb) = 8.5 F = 8.5 x 10715 m (30) 


which is not small compared to «s(u”, Pb). Equation (10) would therefore lead 
to large corrections (several MeV), of the same order of magnitude as the 
energy E,(u”, Pb). We therefore see that, in this case, the volume effect can no 
longer be treated as a perturbation (see discussion of $4 of complement Ay). 
To calculate the energy levels, it is necessary to know the potential V(r) exactly 
and to solve the corresponding Schródinger equation. 

The muon is therefore more inside the nucleus than outside, that is, 
according to (1), in a region in which the potential is parabolic. In a first 
approximation, we could consider the potential to be parabolic everywhere (as is 
done in complement Ay) and then treat as a perturbation the difference which 
exists for r > po between the real potential and the parabolic potential. However, 
the extension of the wave function corresponding to such a potential is not 
sufficiently smaller than p, for such an approximation to lead to precise results, 
and the only valid method consists of solving the Schródinger equation correspon- 
ding to the real potential. 


References and suggestions for further reading: 


The isotopic volume effect: Kuhn (11.1), chap. VI, §C-3; Sobel'man (11.12), 
chap. 6, 324. 

Muonic atoms (sometimes called mesic atoms): Cagnac and Pebay-Peyroula 
(11.2), chap. XIX, §7-C; De Benedetti (11.21); Wiegand (11.22); Weissenberg (16.19), 
y 42, 
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THE VARIATIONAL METHOD 


I. Principle of the method 

a. A property of the ground state of a system 

b. Generalization: the Ritz theorem 

c. A special case where the trial functions form a subspace 
2. Application to a simple example 

a. Exponential trial functions 

b. Rational trial functions 


3. Discussion 


The perturbation theory studied in chapter XI is not the only general approximation 
method applicable to conservative systems. We shall give a concise description 
here of another of these methods, which also has numerous applications, especially 
in atomic and molecular physics, nuclear physics, and solid state physics. First 
of all, we shall indicate, in $1, the principle of the variational method. Then 
we shall use the simple example of the one-dimensional harmonic oscillator to 
bring out its principal features (§2), which we shall briefly discuss in $3. Comple- 
ments Fx, and Gx, apply the variational method to simple models which enable 
us to understand the behavior of electrons in a solid and the chemical bond. 


1. Principle of the method 


Consider an arbitrary physical system whose Hamiltonian H is time- 
independent. To simplify the notation, we shall assume that the entire spectrum of H 
is discrete and non-degenerate : 


H | @, > = E, | o >in = 0, 1, 2, (1) 


Although the Hamiltonian H is known, this is not necessarily the case for its 
eigenvalues E, and the corresponding eigenstates | p, >. The variational method is, 
of course, most useful in the cases in which we do not know how to diagonalize H 
exactly. 


a. A PROPERTY OF THE GROUND STATE OF A SYSTEM 


Choose an arbitrary ket | y > of the state space of the system. The mean 
value of the Hamiltonian H in the state | Y > is such that: 


my > g, (2) 


(where E, is the smallest eigenvalue of H), equality occuring if and only if | y > 
is an eigenvector of H with the eigenvalue Fy. 
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To prove inequality (2), we expand the ket | y > on the basis of eigenstates 
of H: 


ly>= dial, > (3) 


n 


We then have: 

Cw] HW > =D lc, En > Eo 2 len? (4) 
with. of course: 

SARA (5) 


which proves (2). For inequality (4) to become an equality, it is necessary and 
sufficient that all the coefficients c, be zero, with the exception of co; | y > is then 
an eigenvector of H with the eigenvalue E,. 

This property is the basis for a method of approximate determination of Ep. 
We choose (in theory, arbitrarily, but in fact, by using physical criteria) a family 
of kets | y(x) which depend on a certain number of parameters which we 
symbolize by æ. We calculate the mean value < H >(a) of the Hamiltonian H in 
these states, and we minimize < H >(a) with respect to the parameters a. The 
minimal value so obtained constitutes an approximation of the ground state E, 
of the system. The kets | w(a)> are called trial kets, and the method itself, the 
variational method. 


COMMENT: 


The preceding proof can easily be generalized to cases in which the 
spectrum of H is degenerate or includes a continuous part. 


b. GENERALIZATION: THE RITZ THEOREM 


We shall show that, more generally, the mean value of the Hamiltonian H is 
Stationary in the neighborhood of its discrete eigenvalues. 
Consider the mean value of H in the state | y >: 


_ JH W> 
== TO le 


as a functional of the state vector | y >, and calculate its increment ô< H > when | y > 
becomes | y > + | dw >, where | dy > is assumed to be infinitely small. To do so, 
it is useful to write (6) in the form: 


<H>< y ly>=<yw|H|y> (7) 
and to differentiate both sides of this relation : 
wld H) +H [Kw lop) + (dW |W] (8) 


=<p|H|dv> + <dv|A|w>d 
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that is, since < H > is a number: 


<wlw>o HY 
=(w|[H-<H)>]|o¥>+<dW|[H-<H)>]|¥> (00) 


The mean value < H > will be stationary if: 
6H) =0 (10) 


which, according to (9), means that: 


(y |[H =<H>31]94> + (dw |LH- <A>] |¥> =90 (11) 
We set: 

lo>=[H-<H)>]|¥> (12) 
Relation (11) can then be written simply: 

<p l|dw> + <d¥| p> =9 (13) 


This last relation must be satisfied for any infinitesimal ket | dy >. In particular, 
if we choose: 


[öy > = dA] > (14) 
(where 6A is an infinitely small real number), (13) becomes: 
2<p|p>014=0 (15) 


The norm of the ket | y > is therefore zero, and | ~ > must consequently be zero. 
With definition (12) taken into account, this means that: 


H|W>=<H)|w> (16) 


Consequently, the mean value < H > is stationary if and only if the state vector | y > 
to which it corresponds is an eigenvector of H, and the stationary values of < H > 
are the eigenvalues of the Hamiltonian. 

The variational method can therefore be generalized and applied to the 
approximate determination of the eigenvalues of the Hamiltonian H. If the 
function < H X(x) obtained from the trial kets | W(x) > has several extrema, they give 
the approximate values of some of its energies E, (cf. exercise 10 of comple- 
ment Hy,). 


c. A SPECIAL CASE WHERE THE TRIAL FUNCTIONS FORM A SUBSPACE 


Assume that we choose for the trial kets the set of kets belonging to a vector 
subspace ¥ of &. In this case, the variational method reduces to the resolution 
of the eigenvalue equation of the Hamiltonian H inside F, and no longer in all of £. 

To see this, we simply apply the argument of $ 1-b, limiting it to the kets | yw > 
of the subspace 4. The maxima and minima of < H >, characterized by < H > = 0, 
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are obtained when | y > is an eigenvector of H in F. The corresponding eigenvalues 
constitute the variational method approximation for the true eigenvalues of H in €. 

We stress the fact that the restriction of the eigenvalue equation of H to a 
subspace ¥ of the state space & can considerably simplify its solution. However, 
if F is badly chosen, it can also yield results which are rather far from the true 
eigenvalues and eigenvectors of H in & (cf. $ 3). The subspace F must therefore be 
chosen so as to simplify the problem enough to make it soluble, without too 
greatly altering the physical reality. In certain cases, it is possible to reduce the 
study of a complex system to that of a two-level system (cf. chap. IV), or at least, 
to that of a system of a limited number of levels. Another important example of this 
procedure is the method of the linear combination of atomic orbitals, widely used in 
molecular physics. This method consists essentially (cf. complement G,,) of the 
determination of the wave functions of electrons in a molecule in the form of 
linear combinations of eigenfunctions associated with the various atoms which 
constitute the molecule, treated as if they were isolated. It therefore limits the 
search for the molecular states to a subspace chosen using physical criteria. 
Similarly, in complement F,,;, we shall choose as a trial wave function for an 
electron in a solid a linear combination of atomic orbitals relative to the various 
ions which constitute this solid. 


COMMENT: 


Note that first-order perturbation theory fits into this special case of the 
variational method: ¥ is then an eigensubspace of the unperturbed 
Hamiltonian H,. 


2. Application to a simple example 


To illustrate the discussion of §1 and to give an idea of the validity of the 
approximations obtained with the help of the variational method, we shall apply 
this method to the one-dimensional harmonic oscillator, whose eigenvalues and 
eigenstates we know (cf. chap. V). We shall consider the Hamiltonian: 


(17) 
and we shall solve its eigenvalue equation approximately by variational calculations. 


a. EXPONENTIAL TRIAL FUNCTIONS 


Since the Hamiltonian (17) is even, it can easily be shown that its ground state 
is necessarily represented by an even wave function. To determine the charac- 
teristics of this ground state, we shall therefore choose even trial functions. We 
take, for example, the one-parameter family : 


W(x) = ec" ; a>0 (18) 


1151 


www.elsolucionario.net 
COMPLEMENT Ex; 


The square of the norm of the ket | y, > is equal to: 


(volver =| aver (19) 
and we find : 
< pal H| ya) = dx e7% TA - += mors? | ga 
e 2m dx* 
h? l 2 l E — 2ax? 
= E a tg mw ;| | A dx e (20) 
so that : 
oa ieee 21 
~ 2m 8 ol (21) 


The derivative of the function < H X(x) goes to zero for: 


l mo 
& = % = 53h (22) 
and we then have: 
< H dla.) = a (23) 


2 


The minimum value of < H >(a) is therefore exactly equal to the energy of the 
ground state of the harmonic oscillator. This result is due to the simplicity of the 
problem that we are studying: the wave function of the ground state happens to 
be precisely one of the functions of the trial family (18). the one which corresponds 
to value (22) of the parameter a. The variational method, in this case, gives the 
exact solution of the problem (this illustrates the theorem proven in $1-a). 

If we want to calculate (approximately, in theory) the first excited state E, of 
the Hamiltonian (17), we should choose trial functions which are orthogonal to 
the wave function of the ground state. This follows from the discussion of § l-a, 
which shows that < H > has a lower bound of £,, and no longer of Eo, if the 
coefficient c} is zero. We therefore choose the trial family of odd functions: 


2 


ee (24) 
In this case: 
+ % 
al a> -Í dx x? e720? es 
and: 
CUO A a a 2 a- 2ax? 26) 
i a 2m 2 D Ao F x xe ( 
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which yields : 


3h? 3 41 
< H la) = 5z +3 mo ` (27) 
This function, for the same value a, as above [formula (22)], presents a minimum 


equal to: 
3 
<H la) = 5 how (28) 


Here again, we find exactly the energy E, and the associated eigenstate because the 
trial family includes the correct wave function. 


b. RATIONAL WAVE FUNCTIONS 


The calculations of $2-a enabled us to familiarize ourselves with the variational 
method, but they do not really allow us to judge its effectiveness as a method 
of approximation, since the families chosen always included the exact wave 
function. Therefore. we shall now choose trial functions of a totally different type, 
for example* : 

l 


T? > a>0 (29) 


A simple calculation then yields: 


a dx n 
lv = f ras (30) 
-p +a Zaa 
and, finally : 
hli Oe 

< H yla) = 2 3 +5 mo'a (31) 
The minimum value of this function is obtained for: 

a Sulg = oe (32) 

J/2 mo) 

and is equal to: 

ASA (33) 


y2 


This minimum value is therefore equal to x 2 times the exact ground state 
energy ħw/2. To measure the error committed, we can calculate the ratio of 
< H »(a,) — ħw/2 to the energy quantum hw: 


l 
(Hyla) -30 7 
2 2 F 1 o 
ho = 2 = 2) %o (34) 


* Our choice here is dictated by the fact that we want the necessary integrals to be analytically 
calculable. Of course. in most real cases, one resorts to numerical integration. 
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3. Discussion 


The example of $2-b shows that it is easy to obtain the ground state energy 
of a system, without significant error, starting with arbitrarily chosen trial kets. 
This is one of the principal advantages of the variational method. Since the exact 
eigenvalue is a minimum of the mean value < H >, it is not surprising that < H > does 
not vary very much near this minimum. 

On the other hand, as the same reasoning shows, the “approximate” state 
can be rather different from the true eigenstate. Thus, in the example of $2-b, the 
wave function 1/(x? + ao) [where a, is given by formula (32)] decreases too 
rapidly for small values of x and much too slowly when x becomes large. Table I 
gives quantitative support for this qualitative assertion. It gives, for various values 
of x?, the values of the exact normalized eigenfunction : 


polx) = (20%9/n)!/* e71 


[where a, was defined in (22)] and of the approximate normalized eigenfunction: 


h D 3/4 h 
Elay) PO eras) 
ya Vax? +a Na 1 + 24201? 
x | (2) + ; (= 
0 NN ox [voto 
i | n 71424 2apx? 
0 | 0.893 1.034 
1/2 0.696 0.605 
1 0.329 0.270 
3/2 0.094 0.140 
2 0.016 0.083 
5/2 0.002 0.055 
3 0.000 1 | 0.039 


TABLE I 


It is therefore necessary to be very careful when physical properties other 
than the energy of the system are calculated using the approximate state obtained 
from the variational method. The validity of the result obtained varies enormously 
depending on the physical quantity under consideration. In the particular problem 
which we are studying here, we find, for example, that the approximate mean value 
of the operator X?* is not very different from the exact value: 


< Wao X’ | Wag > 1 A (36) 


Wool Van > Ja MO 


* The mean value of X is automatically zero, as is correct since we have chosen even trial functions. 
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which is to be compared with h/2mw. On the other hand, the mean value of X* is 
infinite for the wave function (35), while it is, of course, finite for the real wave 
function. More generally, table I shows that the approximation will be very poor 
for all properties which depend strongly on the behavior of the wave function 
for x Z 2/V Ap. 

The drawback we have just mentioned is all the more serious as it is very 
difficult, if not impossible, to evaluate the error in a variational calculation if we 
do not know the exact solution of the problem (and, of course, if we use the 
variational method, it is because we do not know this exact solution). 

The variational method is therefore a very flexible approximation method, 
which can be adapted to very diverse situations and which gives great scope to 
physical intuition in the choice of trial kets. It gives good values for the energy 
rather easily, but the approximate state vectors may present certain completely 
unpredictable erroneous features, and we cannot check these errors. This method is 
particularly valuable when physical arguments give us an idea of the qualitative 
or semi-quantitative form of the solutions. 


References and suggestions for further reading: 


The Hartree-Fock method, often used in physics, is an application of the varia- 
tional method. See references of chapter XT. 

The variational method is of fundamental importance in molecular physics. 
See references of complement Gy. 

For a simple presentation of the use of variational principles in physics, 
see Feynman II (7.2), chap. 19, 
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Complement Fy, 


ENERGY BANDS OF ELECTRONS IN SOLIDS : 
A SIMPLE MODEL 


1. A first approach to the problem: qualitative discussion 
2. A more precise study using a simple model 
a. Calculation of the energies and stationary states 
a. Description of the model; simplifying hypotheses 
B. Possible energies; the concept of an energy band 
y. Stationary states; Bloch functions 
ò. Periodic boundary conditions 


b. Discussion 


A crystal is composed of atoms evenly distributed in space so as to form a three- 
dimensional periodic lattice. The theoretical study of the properties of a crystal, 
which brings into play an extremely large number of particles (nuclei and electrons), 
poses a problem which is so complicated that it is out of the question to treat it 
rigorously. We must therefore resort to approximations. 

The first of these is of the same type as the Born-Oppenheimer approximation 
(which we encountered in $ 1 of complement Ay). It consists of considering, first 
of all, the positions of the nuclei as fixed, which enables us to study the stationary 
states of the electrons subjected to the potential created by the nuclei. The motion 
of the nuclei is not treated until later, using the knowledge of the electronic 
energies*. In this complement, we shall concern ourselves only with the first step 
of this calculation, and we shall assume the nuclei to be motionless at the nodes of 
the crystalline lattice. 

This problem still remains extremely complicated. It is necessary to calculate 
the energies of a system of electrons subjected to a periodic potential and interacting 
with each other. We then make a second approximation : we assume that each 
electron, at a position r;,, is subjected to the influence of a potential V(r;) which 
takes into account the attraction exerted by the nuclei and the average effect of the 
repulsion of all the other electrons**. The problem is thus reduced to one involving 
independent particles, moving in a potential which has the periodicity of the 
crystalline lattice. 

The physical characteristics of a crystal therefore depend, in a first approxima- 
tion, on the behavior of independent electrons subjected to a periodic potential. 
We could be led to think that each electron remains bound to a given nucleus, 
as happens in isolated atoms. We shall see that, in reality. the situation is completely 


* Recall that the study of the motion of the nuclei leads to the introduction of the normal 
vibrational modes of the crystal: phonons (cf. complement Jy). 

** This approximation is of the same type as the “central field” approximation for isolated 
atoms (cf. complement Ax y. $1). 
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different. Even if an electron is initially in the neighborhood of a particular nucleus, 
it can move into the zone of attraction of an adjacent nucleus by the tunnel effect, 
then into another, and so on. Actually, the stationary states of the electrons are 
not localized in the neighborhood of any nucleus, but are completely delocalized : 
the probability density associated with them is uniformly distributed over all the 
nuclei*. Thus, the properties of an electron placed in a periodic potential resemble 
those of an electron free to move throughout the crystal more than they do those 
of an electron bound to a particular atom. Such a phenomenon could not exist in 
classical mechanics: the direction of a particle traveling through a crystal would 
change constantly under the influence of the potential variations (for example, 
upon skirting an ion). In quantum mechanics, the interference of the waves 
scattered by the different nuclei permit the propagation of an electron inside the 
crystal. 

In $1, we shall study very qualitatively how the energy levels of isolated 
atoms are modified when they are brought gradually closer together to form a linear 
chain. Then, in §2. still confining ourselves, for simplicity, to the case of a 
linear chain, we shall calculate the energies and wave functions of stationary states 
a little more precisely. We shall perform the calculation in the “strong bonding 
approximation” : when the electron is in one site, it can move to one of two neigh- 
boring sites via the tunnel effect. The strong bonding approximation is equivalent 
to assuming that the probability of its tunneling is small. We shall, in this way, 
establish a certain number of results (the delocalization of stationary states, the 
appearance of allowed and forbidden energy bands, the form of Bloch functions) 
which remain valid in more realistic models (three-dimensional crystals, bonds of 
arbitrary strength). 

The “perturbation” approach which we shall adopt here constructs the 
stationary states of the electrons from atomic wave functions localized about the 
various ions. It has the advantage of showing how atomic levels change gradually 
to energy bands in a solid. Note, however, that the existence of energy bands can be 
directly established from the periodic nature of the structure in which the electron 
is placed (see, for example, complement O,,,, in which we study quantization of the 
energy levels in a one-dimensional periodic potential). 

Finally, we stress the fact that we are concerned here only with the properties 
of the individual stationary states of the electrons. To construct the stationary 
state of a system of N electrons from these individual states, it is necessary to apply 
the symmetrization postulate (cf. chap. XIV), since we are dealing with a system 
of identical particles. We shall treat this problem again in complement Cy y, when 
we shall describe the spectacular consequences of Pauli’s exclusion principle on the 
physical behavior of the electrons in a solid. 


* This phenomenon is analogous to the one we encountered in the study of the ammonia molecule 
(cf. complement G,y). There, since the nitrogen atom can move from one side of the plane of the hydrogen 
atoms to the other, by the tunnel effect. the stationary states give an equal probability of finding it in 
cach of the two corresponding positions. 
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1. A first approach to the problem: qualitative discussion 


We return to the example of the ionized H molecule, studied in $ C-2-c 
and C-3-d of chapter IV. Consider, therefore, two protons P, and P,, whose 
positions are fixed, and an electron which is subject to their electrostatic attraction. 
This electron sees a potential V(r). which has the form indicated in figure |. In terms 
of the distance R between P, and P, (considered as a parameter) what are the 
possible energies and the corresponding stationary states? 


V(x) 


\ |. 


FIGURE 1 


The potential seen by the electron in the ionized H} molecule as it moves along the Ox axis defined 
by the two protons. We obtain two wells separated by a barrier. If, at any instant, the electron is 
localized in one of the two wells, it can move into the other well via the tunnel effect. 


We shall begin by considering the limiting case in which R > a, (where a, is 
the Bohr radius of the hydrogen atom). The ground state is then two-fold 
degenerate : the electron can form a hydrogen atom either with P, or with P,; 
it is practically unaffected by the attraction of the other proton, which is very far 
away. In other words, the coupling between the states | p, > and | p, > considered 
in chapter IV (localized states in the neighborhood of P, or P,; cf. fig. 13 of 
chapter IV) is then negligible, so that |g, > and | p, > are practically stationary 
states. 

If we now choose a value of R comparable to a,, it is no longer possible to 
neglect the attraction of one or the other of the protons. If, at £ = 0, the electron 
is localized in the neighborhood of one of them, and even if its energy is lower 
than the height of the potential barrier situated between P, and P, (cf. fig. 1), it can 
move to the other proton by the tunnel effect. In chapter IV we studied the effect of 
coupling of the states | p, > and | p, >, and we showed that it produces an oscillation 
of the system between these two states (the dynamical aspect). We have also seen 
(the static aspect) that this coupling removes the degeneracy of the ground state 
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and that the corresponding stationary states are “delocalized” (for these states, the 
probability of finding the electron in the neighborhood of P, or P, is the same). 
Figure 2 shows the form of the variation with respect to R of the possible energies 
of the system*. 

Two effects appear when we decrease the distance R between P, and P,. On 
the one hand, an R = œ energy value gives rise to two distinct energies when R 
decreases (when the distance R is fixed at a given value R,, the stronger the coupling 
between the states | p} > and | y, >, the greater the difference between these two 
energies). On the other hand, the stationary states are delocalized. 

It is easy to imagine what will happen if the electron is subject to the influence, 
not of two, but of three identical attractive particles (protons or positive ions), 
arranged, for example, in a straight line at intervals of R. When R is very large, the 
energy levels are triply degenerate, and the stationary states of the electron can be 
chosen to be localized in the neighborhood of any one of the fixed particles. 


FIGURE 2 


Variation of the energy of stationary 
states of the electron in terms of the 
distance R between the two protons of 


the H} ion. When R is large, there are 
two practically degenerate states, of 
energy — £,. When R decreases, this 


degeneracy is removed. The smaller R, 
the greater the splitting. 


If R is decreased, each energy gives rise to three generally distinct energies and, 
in a stationary state, the probabilities of finding the electron in the three wells are 
comparable. Moreover, if, at the initial instant, the electron is localized in the 
right-hand well, for example, it moves into the other wells during its subsequent 
evolution**, 

The same ideas remain valid for a chain composed of an arbitrary number . Y 
of ions which attract an electron. The potential seen by the electron is then 
composed of M regularly spaced identical wells (in the limit in which M —== æ, 
it is a periodic potential). When the distance R between the ions is large, the energy 
levels are -fold degenerate. This degeneracy disappears if the ions are moved 


* A detailed study of the H} ion is presented in complement Gx. 
** See exercise 8 of complement J,,. 
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closer together: each level gives rise to distinct levels, which are distributed, as 
shown in figure 3, in an energy interval of width 4. What now happens if the value 
of N is very large? In each of the intervals 4, the possible energies are so close that 
they practically form a continuum: “allowed energy bands” are thus obtained, 
separated by “forbidden bands”. Each allowed band contains . Y” levels (actually 
2.1” if the electron spin is taken into account). The stronger the coupling causing 
the electron to pass from one potential well to the next one, the greater the band 
width. (Consequentlv. we expect the lowest encrey bands to be the narrowest since 
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FIGURE 3 


Energy levels of an electron subject to the action of ./ regularly spaced identical ions. When £R is 
very large, the wave functions are localized about the various ions, and the energy levels are the 
atomic levels, ./-fold degenerate (the electron can form an atom with any one of the ~ ions). 
In the figure, two of these levels are shown, of energies — E and — E’. When R decreases, the 
electron can pass from one ion to another by the tunnel effect, and the degeneracy of the levels is 
removed. The smaller R, the greater the splitting. For the value R, of R found in a crystal, each of 
the two original atomic levels is therefore broken down into ./ very close levels. If ./ is very large, 
these levels are so close that they yield energy bands, of widths 4 and A’, separated by a forbidden 
band. 
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the tunnel effect which is responsible for this passage is less probable when the 
energy is smaller). The stationary states of the electron are all delocalized. The 
analogue here of figure 3 of complement My, is figure 4, which represents the 
energy levels and gives an idea of the spatial extension of the associated wave 
functions. 


V(x) 

x 
| | | 
| | ! | | 

EN | | | | | 
at e a a 5 y 
— =a = w] 

| pis a | 

| 

at ES SS ES == 

| | 

| | | 

| | 

| | 

FIGURE 4 


Energy levels for a potential composed of several regularly spaced wells. Two bands are shown in 
this figure, one of width 4 and the other of width 1’. The deeper the band, the more narrow it is, 
since crossing the barrier by the tunnel effect is then more difficult. 


Finally, note that if, at £ = 0, the electron is localized at one end of the chain, 
it propagates along the chain during its subsequent evolution. 


2. A more precise study using a simple model 


a. CALCULATION OF THE ENERGIES AND STATIONARY STATES 


To complete the qualitative considerations of the preceding section, we 
shall discuss the problem more precisely, using a simple model. We shall perform 
calculations analogous to those of §C of chapter IV, but adapted to the case in 
which the system under consideration contains an infinite number of ions (instead 
of two), regularly spaced in a linear chain. 
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a. Description of the model; simplifying hypotheses 


Consider, therefore, an infinite linear chain of regularly spaced positive ions. 
As in chapter IV, we shall assume that the electron, when it is bound to a given ion, 
has only one possible state: we shall denote by | v, > the state of the electron when 
it forms an atom with the nth ion of the chain. For the sake of simplicity, we shall 
neglect the mutual overlap of the wave functions v,(x) associated with neighboring 
atoms, and we shall assume the { | v, > } basis to be orthonormal: 


< ty | 0p > = Sup (1) 


Moreover, we shall confine ourselves to the subspace of the state space spanned 
by the kets | v, >. It is obvious that by restricting the state space accessible to the 
electron in this way, we are making an approximation. This can be justified by using 
the variational method (cf. complement E,,,) : by diagonalizing the Hamiltonian H, 
not in the total space, but in the one spanned by the | v, >, it can be shown that we 
obtain a good approximation for the true energies of the electron. 

We shall now write the matrix representing the Hamiltonian H in the { | v, > } 
basis. Since the ions all play equivalent roles, the matrix elements < v, | H | o, > are 
necessarily all equal to the same energy E,. In addition to these diagonal elements, 
H also has non-diagonal elements < vo, | H | v, > (coupling between the various 
states |v, >, which expresses the possibility for an electron to move from one 
ion to another). This coupling is obviously very weak for distant ions; this is why 
we shall take into account only the matrix elements < v, | H | 0,,, >, which we 
shall choose equal to a real constant — A. Under these conditions, the (infinite) 
matrix which represents H can be written: 


—A Eo — A 0 
(H) = y (2) 


0 0 -A E, 


To find the possible energies and the corresponding stationary states, we must 
diagonalize this matrix. 


B. Possible energies ; the concept of an energy band 
Let | y > be an eigenvector of H; we shall write it in the form: 
+00 
lp>= Y co) (3) 
q= 00 
Using (2), the eigenvalue equation: 


Hlp9>=El0Q> (4) 
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projected onto | v, >, yields: 
Eyty= A G+ Ati Et, (5) 


When q takes on all positive or negative integral values, we thus obtain an 
infinite system of coupled linear equations which, in certain ways, recall the coupled 
equations (5) of complement Jų. As in that complement, we shall look for simple 
solutions of the form: 


pee (6) 


where / is the distance between two adjacent ions, and k is a constant whose 
dimensions are those of an inverse length. We require k to belong to the “first 
Brillouin zone”, that is, to satisfy : 


-F<k< +5 (7) 


This is always possible, because two values of k differing by 2x// give all the 
coefficients c, the same value. Substituting (6) into (5), we obtain: 


Eb eikal oe A[e kar im + eittia- 1) Z E pika! (8) 
that is, dividing by e***!: 
E = E(k) = E, — 2A cos kl (9) 


If this condition is satisfied, the ket | y > given by (3) and (6) is an eigenket of H ; 
its energy depends on the parameter k, as is indicated by (9). 

Figure 5 represents the variation of E with respect to k. It shows that the 
possible energies are situated in the interval [£, — 24, Ey + 2A]. We therefore 
obtain an allowed energy band, whose width 44 is proportional to the strength of the 
coupling. 


FIGURE 5 


Possible energies of the electron in 
terms of the parameter k (k varies 
within the first Brillouin zone). An 
energy band therefore appears, with 
a width 44 which is proportional to 

i i the coupling between neighbouring 

— ail 0 + afl atoms. 


IE, — 2A 
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Y. Stationary states ; Bloch functions 


Let us calculate the wave function p,(x) = < x |p, > associated with the 
stationary state | p, > of energy E(k). Relations (3) and (6) give: 


lo>= Y eto) (10-a) 
that is: o 

ax) = E et n(x) (10-b) 
where: o 

p(x) = <x | 0, > (11) 


is the wave function associated with the state | v, >. Since the state | v, > can be 
obtained from the state | vo > by a translation of ql. we have: 


v(x) = volx — ql) (12) 
so that (10-b) can be written: 
+0 


ox) = Y e v(x — ql) (13) 


q=-% 


We now calculate g(x + /): 


+ 00 
ole tN = E olx (9-1 
q=-w0 
` tao . 
= el Y e ol — (q ~ 1] 
q=-> Ww 
= e™ 9,(x) (14) 
To express this remarkable property simply, we set: 
px) = e™ u(x) (15) 
The function .,(x) so defined then satisfies: 
u,(x + 1) = m(x) (16) 


tkx 


Therefore, the wave function @,(x) is the product of e'** and a periodic function 
which has the period / of the lattice. A function of type (15) is called a Bloch 
function. Note that, if n is any integer: 


lox + ni)? = lol)? (17) 


a result which demonstrates the delocalization of the electron : the probability 
density of finding the electron at any point on the x-axis is a periodic function of x. 


COMMENT: 


Expressions (15) and (16) have been proven here for a simple model. Actually, 
this result is more general and can be proven directly from the symmetries of the 
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Hamiltonian H (Bloch’s theorem). To show this, let us call S(a) the unitary operator 
associated with a translation of a along Ox (cf. complement £,,, $3). Since the system 
is invariant under any translation which leaves the ion chain unchanged, we must have : 
[H, S(1)] = 0 (18-a) 


We can therefore construct a basis of eigenvectors common to the operator S(/) and H. 
Now, equation (14) is simply the one which defines the eigenfunctions of S(— /) [since 
this operator is unitary, its eigenvalues can always be written in the form e*!', where k 
satisfies condition (7); cf. complement C,,, $1-d]. It is then simple to get. as before. (15) 
and (16) from (14). 

Note that, for any a, we have, in general: 
[H. S(a)] #0 (18-b) 


unlike the situation of a free particle (or one subject to the influence of a constant 
potential). For a free particle, since H commutes with all operators S(a) (that is, with 
the momentum P,; cf. complement E. $ 3), the stationary wave functions are of the 
form: 


w(x) oc ef (19) 


The fact that, in our case, (18-b) is satisfied only for certain values of a explains why 
form (15) is less restrictive than (19). 


ò. Periodic boundary conditions 


To each value of k in the interval [ — z//, + x//[ therefore corresponds an 
eigenstate | g > of H, with the coefficients c, appearing in expansion (3) of | p > 
given by equation (6). We thus obtain an infinite continuum of stationary states. 
This is due to the fact that we have considered a linear chain containing an 
infinite number of ions. What happens when we consider a finite linear chain, of 
length L, composed of a large number .4" of ions? 

The qualitative considerations of $ 1 show that there must then be .4' levels 
in the band (2.+" if spin is taken into account). The exact determination of the 
corresponding . $` stationary states is a difficult problem, since it is necessary to 
take into account the boundary conditions at the ends of the chain. It is clear, 
however, that the behavior of electrons sufficiently far from the ends are little 
affected by the “edge effects”*. This is why one generally prefers, in solid state 
physics, to substitute for the real boundary conditions, new boundary conditions, 
which, despite their artificial character, have the advantage of leading to much 
simpler calculations, while conserving the most important properties necessary 
for the comprehension of effects other than the edge effects. 

These new boundary conditions, called periodic boundary conditions, or 
“Born-Von Karman conditions” (B.V.K. conditions). require the wave function 
to take on the same value at both ends of the chain. We can also ¡imagine that we are 
placing an infinite number of identical chains, all of length L, end to end. We then 


* For a three-dimensional crystal. this amounts to establishing a distinciion between ” bulk 
effects” and “surface effects”. 
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require the wave function of the electron to be periodic, with a period £. Equations (5) 
remain valid, as does their solution (6), but the periodicity of the wave function 
now implies : 


gre I (20) 
Consequently. the only possible values of k are of the form: 


2n 

k, = nT (21) 
where n is a positive or negative integer or zero. Let us now verify that the B. V.K. 
conditions give the correct result for the number of stationary states contained 
in the band. To do so, we must calculate the number of allowed values k, included 
in the first Brillouin zone. We obtain this number by dividing the width 2z// of this 
zone by the interval 27/£ between two adjacent values of k, which indeed 
gives us: 


==> = VW-12WV (22) 


r L 
L l 


We should also show that the .V stationary states obtained with the B.V.K. 
conditions are distributed in the allowed band with the same density* p(£) as the 
true stationary states (associated with the real boundary conditions). As the 
density of states p(E) plays a very important role in the comprehension of the 
physical properties of a solid (we shall discuss this point in complement Cy,y), 
it is important for the new boundary conditions to leave it unchanged. That 
the B.V.K. conditions give the correct density of states will be proven in 
complement Cy, ($1-c) for the simple example of a free electron gas enclosed 
in a “rigid box”. In this case, the true stationary states can be calculated and 
compared with those obtained by using the periodic boundary conditions on the 
walls of the box (see also $3 of complement O,,,). 


b. DISCUSSION 


Starting with a discrete non-degenerate level for an isolated atom (for 
example, the ground level) we have obtained a series of possible energies, grouped 
in an allowed band of width 44 for the chain of ions being considered. If we had 
started with another level of the atom (for example, the first excited level), we 
would have obtained another energy band, and so on. Each atomic level yields one 
energy band, as figure 6 shows, and there appears a series of allowed bands, 
separated by forbidden bands. 

Relation (6) shows that, for a stationary state, the probability amplitude of 
finding the electron in the state | v, > is an oscillating function of q, whose modulus 
does not depend on q. This recalls the properties of phonons, the normal vibra- 
tional modes of an infinite number of coupled oscillators for which all the oscillators 
participate in the collective vibration with the same amplitude, but with a certain 
phase shift (cf. complement Jy). 


* p(E) dE is the number of distinct stationary states with energies included between E and E + dE. 
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How can we obtain states in which the electron is not completely delocalized ? 


For a free electron, we saw in chapter I that we must superpose plane waves so as to 
form a free “wave packet”: 


Wx, t) = ms E ĝ(k) eilkx — Etoh) (23) 
V LT 
Allowed bands 
Y Y 
eS SS 
y nmn E 
FIGURE 6 


Allowed bands and forbidden bands on the energy axis. 


The maximum of this wave packet propagates at the group velocity (cf. chap I,§ C): 


A 1| dE hk 
lo H ea i) 
k = ko 


[where kọ is the value of k for which the function g(k) presents a peak]. Here, 
we must superpose wave functions of type (15), and the corresponding ket can be 
written : 


| W(t) > = se [ak glk) eF] o, > (25) 
T 


where g(k) is a function of k which has the form of a peak about k = ky. We shall 
calculate the probability amplitude of finding the electron in the state |v, >. 
Using (10-a) and (1), we can write: 


Cv, |) > = z= E g(k) esla! EUA T 
T 


Replacing ql by x in this relation, we obtain a function of x: 


E g(k) eilkx— Eckyh] (27) 


x(x, t) = 
on 


Only the values at the points x = 0, + ql, + 2gl, etc... of this function are really 
significant and yield the desired probability amplitudes. 

Relation (27) is entirely analogous to (23). By applying (24), it can be shown 
that y(x. 1) takes on significant values only in a limited domain of the x-axis whose 
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center moves at the velocity: 


It follows that the probability amplitude < v, | w(t) > is large only for certain 
values of g: therefore, the electron is no longer delocalized, but moves in the 
crystal at the velocity V¿ given by (28). 

Equation (9) enables us to calculate this velocity explicitly : 


2Al . 
Y = = sin Kol (29) 


This function is shown in figure (7). It is zero when k, = 0, that is, when the 
energy is minimal ; this is also a property of the free electron. However, when k takes 
on non-zero values, important departures from the behavior of a free electron 
occur. For example, as soon as ky > 2/2/, the group velocity is no longer an 
increasing function of the energy. It even goes to zero when ky = + 2// (at the 
borders of the first Brillouin zone). This indicates that an electron cannot move 
in the crystal if its energy is too close to the maximum value Fy + 2A appearing 
in figure 5. The optical analogy of this situation is Bragg reflection. X rays whose 
wavelength is equal to the unit edge of the crystalline lattice cannot propagate 
in it : interference of the waves scattered by each of the ions lead to total reflection. 


FIGURE 7 


Group velocity of the electron as 
a function of the parameter k. This 
velocity goes to zero, not only for 
k = 0 (as for the free electron), but 
also for k = + n/i (the edges of the 


first Brillouin zone). 


References and suggestions for further reading: 


Feynman III (1.2), chap. 13; Mott and Jones (13.7), chap. II, $4: references 
of section 13 of the bibliography. 
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A SIMPLE EXAMPLE OF THE CHEMICAL BOND: 
THE Hý ION 


l. Introduction 
a. General method 
b. Notation 
c. Principle of the exact calculation 
2. The variational calculation of the energies 
a. Choice of the trial kets 
b. The eigenvalue equation of the Hamiltonian H in the trial ket subspace F 
ce. Overlap, Coulomb and resonance integrals 
d. Bonding and antibonding states 


3. Critique of the preceding model. Possible tmprovements 


a. Results for small R 
b. Results for large R 


4, Other molecular orbitals of the H} ion 


a. Symmetries and quantum numbers. Spectroscopic notation 
b. Molecular orbitals constructed from the 2p atomic orbitals 


5, The origin of the chemical bond; the virial theorem 
a. Statement of the problem 
b. Some useful theorems 
c. The virial theorem applied to molecules 
d. Discussion 


1. Introduction 


In this complement, we intend to show how quantum mechanics enables us to 
understand the existence and properties of the chemical bond, which is responsible 
for the formation of more or less complex molecules from isolated atoms. Our 
aim is to explain the basic nature of these phenomena and not, of course, to enter into 
details which could only be covered in a specialized book on molecular physics. 
This is why we shall study the simplest molecule possible, the H} ion, which is 
composed of two protons and a single electron. We have already discussed certain 
aspects of this problem. in chapter FV ($C-2-c) and in exercise 5 of complement K,; 
we shell consider it here in a more realistic and systematic fashion. 


a. GENERAL METHOD 


When the two protons are very far from each other, the electron forms a 
hydrogen atom with one of them, and the other one remains isolated, in the 
form of an H* ion. If the two protons are brought closer together, the electron 
will be able to “jump” from one to the other. This radically modifies the situation 
(cf. chap. IV, $C-2). We shall therefore study the variation of the energies of the 
stationary states of the system with respect to the distance between the two protons. 
We shall see that the energy of the ground state reaches a minimum for a certain 
value of this distance, which explains the stability of the HZ molecule. 
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In order to treat the problem exactly, it would be necessary to write the 
Hamiltonian of the three-particle system and solve its eigenvalue equation. 
However, it is possible to simplify this problem considerably by using the Born- 
Oppenheimer approximation (cf. complement Ay, $1-a). Since the motion of the 
electron in the molecule is considerably more rapid than that of the protons, the 
latter can be neglected in a first approximation. The problem is then reduced to 
the resolution of the eigenvalue equation of the Hamiltonian of the electron 
subject to the attraction of two protons which are assumed to be fixed. In other 
words, the distance R between the two protons is treated, not like a quantum 
mechanical variable, but like a parameter, on which the electronic Hamiltonian 
and total energy of the system depend. 

In the case of the HZ ion, it so happens that the equation simplified in this 
way is exactly soluble for all values of R. However, this is not true for other, more 
complex, molecules. The variational method, described in complement E,,, must then 
be used. Although we are confining ourselves here to the study of the Hz ion, 
we shall use the variational method, since it can be generalized to the case of 
other molecules. 


b. NOTATION 


We shall call R the distance between the two protons, situated at P, and P,, 
and r, and r, the distances of the electron to each of the two protons (fig. 1). 
We shall relate these distances to a natural atomic unit, the Bohr radius a, 
(cf. chap. VII, § C-2), by setting: 


p = R/a, 
Pı = ri/ao P2 = F2/o (1) 


FIGURE 1 


We call r, the distance between the elec- 
tron (M) and proton P,, r,, the distance 
between the electron and proton P,, and 
P, P, R. the internuclear distance P, P,. 


The normalized wave function associated with the ground state 1s of the hydrogen 
atom formed around proton P, can then be written: 


1 
p = 
i vV nag 


Similarly, we express the energies in terms of the natural unit E,; = e?/2a,; 
E; is the ionization energy of the hydrogen atom. 


eTe (2) 
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It will sometimes be convenient in what follows to use a system of elliptic 
coordinates, in which a point M of space (here, the electron) is defined by: 
ri tra Py tP 
u == = H 
R p 


ir PP 


and the angle gy which fixes the orientation of the MP, P, plane about the P,P, axis 
(this angle also enters into the system of polar coordinates whose Oz axis coincides 
with P, P3). If we fix u and v, and if @ varies between 0 and 2z, the point M describes 
a circle about the P,P, axis. If u (or v) and @ are fixed, M describes an ellipse 
(or a hyperbola) of foci P, and P, when v (or u) varies. It can easily be shown 
that the volume element in this system is: 

R? 


d?r = = (u? — v?) du dv dy (4) 


To do so, we simply calculate the Jacobian J of the transformation: 
LX WZ) => (VU; (5) 


We see immediately that, if P,P, is chosen as the Oz axis, with the origin O in the middle 
of P,P: 


ll 
=x 
+ 
<= 
Ne 
+ 
os 
by 
| 
w| 
—< 


2 
ri 


tang =2 (6) 
x 


ĉ 1 fe er fx x x 
H Be ee ee 
(x R\e0x  €x Rn ry Fils 


(ved fer er vx 

ex R\ex ox) Fara 

Ch Y 

Cy Fr 

Cy _ vy 

Cy Ñ FP) 

ĉn 1 ¿RN 2 + Ri _ uz t+ vRA 
cz R ri rs FP) 

ev Pf2—R/2 z2+R/2] vz +uR/2 
ez: OR ri ra PP) 

a“ a A 
E y ton * V o (7) 
ôx x? + y? dy x+y? Cz 
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The Jacobian J can therefore be written: 


px by pz + vR/2 
1 1 
E > — vx — vy — vz — uR/2 |= a 2 — y?) (8) 
rr 
1x | = ye +y) xy 0 (rara) 
Since: 
dr, r 
ors a (9) 
we get, finally : 
8 
J =————— (10) 
R*(u? — v?) 
Cc. PRINCIPLE OF THE EXACT CALCULATION 


In the Born-Oppenheimer approximation, the equation to be solved in order 
to find the energy levels of the electron in the Coulomb field of the two fixed 
protons can be written: 


h2 e? e? e? 
o +& Joe) = Eo (11) 


If we go into the elliptical coordinates defined in (3), we can separate the variables y, 
v and q. Solving the equations so obtained, we find a discrete spectrum of possible 
energies for each value of R. We shall not perform this calculation here, but shall 
merely represent (the solid-line curve in figure 2) the variation of the ground state 
energy with respect to R. This will enable us to compare the results we shall obtain 
by the variational method with the values given by the exact solution of equation (11). 


2. The variational calculation cf the energies 
a. CHOICE OF THE TRIAL KETS 


Assume R to be much larger than a,. If we are concerned with values of r, 
of the order of ag, we have, practically: 


2 oe 
PS =F for R. r, > ao (12) 
The Hamiltonian: 
P? e e e? 
E E 
ds 2m ri ry R (13) 


is then very close to that of a hydrogen atom centered at proton P,. Analogous 
conclusions are, of course, obtained for R much larger than ag, and r, of the order 
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of ay. Therefore, when the two protons are very far apart, the eigenfunctions of the 
Hamiltonian (13) are practically the stationary wave functions of hydrogen atoms. 

This is, of course, no longer true when a, is not negligible compared to R. 
We see, however, that it is convenient, for all R, to choose a family of trial kets 
constructed from atomic states centered at each of the two protons. This choice 
constitutes the application to the special case of the H ion of a general method 
known as the method of linear combination of atomic orbitals. More precisely, we 


shall call | p, > and | y, > the kets which describe the ls states of the two hydrogen 
atoms: 


T 
p E 
. 


OF 


“ 


FIGURE 2 


Variation of the energy E of the molecular ion H} with respect to the distance R between the two 
protons. 


. Solid line : the exact total energy of the ground state (the stability of the Hj ion is due to the 
existence of a minimum in this curve). 

. dotted line: the diagonal matrix element H,, = H,, of the Hamiltonian H (the variation of 
this matrix element cannot explain the chemical bond). 

. dashed line : the results of the simple variational calculation of $2 for the bonding and antibonding 
states (though approximate, this calculation explains the stability of the H; ion). 

. triangles : the results of the more elaborate variational calculation of $ 3-a (taking atomic orbitals 
of adjustable radius considerably improves the accuracy, especially at small distances). 
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i 
Vra? 
1 


Vna 


We shall choose as trial kets all the kets belonging to the vector subspace F spanned 
by these two kets, that is, the set of kets | y > such that: 


e`’! 


<r|o,>= 


<r] e,> = ge (14) 


l¥>=c,]9,>+¢.|@2> (15) 


The variational method (complement E,,) then consists of finding the stationary 
values of : 


<Y|H|y> 


CF = oie 


(16) 


within this subspace. Since this is a vector subspace, the mean value < H > is minimal 
or maximal when | y > is an eigenvector of H inside this subspace .F, and the corres- 
ponding eigenvalue constitutes an approximation of a true eigenvalue of H in the 
total state space. 


b. THE EIGENVALUE EQUATION OF THE HAMILTONIAN H 
IN THE TRIAL KET VECTOR SUBSPACE # 


The resolution of the eigenvalue equation of H within the subspace F is 
slightly complicated by the fact that | g, > and | p, > are not orthogonal. 

Any vector | y > of ¥ is of the form (15). For it to be an eigenvector of H 
in ¥ with the eigenvalue E, it is necessary and sufficient that: 


Cg |H|W>=EX<@|¥> i=1,2 (17) 
that is: 

2 2 

MA RECAI (18) 
We set 

Sy =X<0,]0,> 

H,, = <9;|H|9;> (19) 


We must solve a system of two linear homogeneous equations: 


(H,, — ES,,)¢, + (H,, — ES,.)¢, = 0 
(H,, — ES) ci + (Hy, — ES,,)c, = 0 (20) 
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This system has a non-zero solution only if: 


Ay, = ES ¡y A,» = ES,, 
H3, = ES)»; H,, = ES), 


=0 (21) 
The possible eigenvalues E are therefore the roots of a second-degree equation. 


c. OVERLAP, COULOMB AND RESONANCE INTEGRALS 
| p, > and | p, > are normalized ; consequently : 
Si, =S,= 1 (22) 


On the other hand, |p, > and |p,> are not orthogonal. Since the wave- 
functions (14) associated with these two kets are real. we have: 


Si = Sa =S (23) 


with: 
Cole [erot oal (24) 


S is called an overlap integral, since it receives contributions only from points of 
space at which the atomic wave functions g, and @, are both different from 
zero (such points exist if the two atomic orbitals partially “ overlap”). A simple 
calculation gives : 


S= Sl +p +3] (25) 


To find this result. we can use elliptic coordinates (3). since: 


uty 
Pi = > P 
— y 
p == P (26) 


According to expression (14) for the wave functions and the one for the volume element, (4), we 
must calculate : 


1 +x +1 2n pa 
S -+f an f of dy — (u? — v?) e7*e 
NAD Y -1 0 8 


p { +x l 
A a G _ 5) ¿uo 27) 
2 j 3 


which easily yields (25). 
By symmetry: 


H,, = Ha, (28) 
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According to expression (13) for the Hamiltonian H, we obtain: 


p? 
Ha = <o |as- Eliad- cali la> +E Co as (29) 
P? e 
Now, | g, > is a normalized eigenket of m The first term of (29) is therefore 


equal to the energy — E, of the ground state of the hydrogen atom, and the third 
term is equal to e?/R; we thus have: 


2 


Hy = E +E-C (30) 
with: 
2 
= <Q, |— “|, >= fer < [o (1) 1? (31) 


C is called a Coulomb integral. It describes (to within a change of sign) the electro- 
static interaction between the proton P, and the charge distribution associated 
with the electron when it is in the Is atomic state around the proton P,. We find: 


Cains al e 1 + p)] (32) 


To find this result, we use elliptic coordinates again : 


2 1 3,3 
6 a lye dro Aas 
aop naz 8 py 
= E, p? | du | dv (u + v)e tve (33) 
1 -l 


Elementary integrations then lead to result (32). 


In formula (30), C can be considered to be a modification of the repulsive 
energy e?/R of the two protons: when the electron is in the state |g, >, the 
corresponding charge distribution “screens” the proton P,. Since |g,(r)|? is 
spherically symmetric about P,, if the proton P, was far enough from it this charge 
distribution would appear to P, like a negative point charge e situated at its center P,, 
(so that the charge of the proton P, would be totally cancelled). This does not 
actually happen unless R is much larger than ay: 


lim E — c| =0 (34) 


R> œ% 


For finite R, the screening effect can only be partial, and we must have: 


e? 


050 (35) 
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2 
The variation of the energy < — C with respect to R is shown in figure 2 by the 


dotted line. It is clear that the variation of H,, (or H,,) with respect to R cannot 
explain the chemical bond, since this curve has no minimum. 

Finally, let us calculate H,, and H,,. Since the wave functions  ,(r) and ¢,(r) 
are real, we have: 


Ay, = Hy, (36) 


Expression (13) for the Hamiltonian gives: 


P? e? e? e? 
Hs = Co. Ellos colo) cello) (37) 
that is, according to definition (24) of S: 
with: 
e? 3 e? 
A=< 017 lo: > = | dr pr) — olr) (39) 
1 1 


We shall call A the resonance integral*. It is equal to: 


A=E, x 2e (1 + p) (40) 


The use of elliptic coordinates enables us to write A in the form: 


2 1 3,3 2 -up 
A S| T T E 
ay ma, 8 (n + v)p 
= re, | du 2u e *P (41) 
1 


The fact that H,, is different from zero expresses the possibility of the 
electron “jumping” from the neighborhood of one of the protons to that of the 
other one. If, at some time, the electron is in the state | p, > (or | o, >), it oscillates 
in time between the two sites, under the influence of the non-diagonal matrix 
element H,,. H,, is therefore responsible for the phenomenon of quantum 
resonance, which we described qualitatively in $C-2-c of chapter IV (hence the 
name of integral 4). 

To sum up, the parameters which are functions of R and which are involved 
in equation (21) for the approximate energies £ are: 


Si, = Sa l 

S= Sa S a 

H,, = H, a} = — E, te 7 € 

H,,=H,, -(- E, rgs- a (42) 


* Certain authors call A an “exchange integral”. We prefer to restrict the use of this term to 
another type of integral which is encountered in many-particle systems (complement Byıy, $ 2-c-B). 
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where S, C and A are given by (25), (32) and (40), and are shown in figure 3. Note 
that the non-diagonal elements of determinant (21) take on significant values only 
if the orbitals g,(r) and qp,(r) partially overlap, since the product q,(r)p,(r) 
appears in definition (39) of A, as well as in that of S. 


de C ate 
ao 
2 
0,5 LE 
240 
pl ] “i i 2 i >p = — 
a 


FIGURE 3 


Variation of S (the overlap integral), C (the Coulomb integral) and 4 (the resonance integral) with 
respect to p = R/aọ. When R ——=> oo, S and A approach zero exponentially, while C decreases 


2 
only with ¢?/R (the “screened ” interaction A — C of the proton P, with the atom centered at P, 


also decreases exponentially, however). 


d. BONDING AND ANTIBONDING STATES 


A. Calculation of the approximate energies 
We set: 
E = sk, 
A= ak, (43) 
C= yE; 
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Equation (21) can then be written : 
iia cas Js fk Wig as 
p p 


(- 1 +Z)s-a—es oi ET | 


=0 (44) 


or: 


2 


E t(e+1-2)s| (45) 
p p 


This gives the following two values for e: 


ee 2 x — y 

ee eF (46-a) 
E z142- 2H 46-b 
os p 1+S ae) 


e, and ¢_ both approach — | when p approaches infinity. This means that the 
two approximate energies E, approach — E,, the ground state energy of an isolated 
hydrogen atom, as expected (§2-a). Furthermore, it is convenient to choose this 
value as the energy origin, that is, to set: 


AE = E({p) — Elo) = E + Ej (47) 


Using (25), (32) and (40), the approximate energies 4£, and AE_ can be written: 


2 
20 (1 +p)F=[1 — e (1 + p)] 
2 A (48) 


AE, =E,4—+ 
p 1 Fe (1 +p + p?/3) 


The variation of AE,/E, with respect to p is shown in dashed lines in figure 2. 
We see that AE_ has a negative minimum for a certain value of the distance R 
between the two protons. Although this is an approximation (cf. fig. 2), it explains 
the existence of the chemical bond. 

As we have already pointed out, the variation with respect to R of the 
diagonal elements H,, and H,, of determinant (21) has no minimum (dotted-line 
curve of figure 2). The minimum of A£_ therefore is due to the non-diagonal 
elements H,, and S,,. This shows that the phenomenon of the chemical bond 
appears only if the electronic orbitals of the two atoms participating in the 
bond overlap sufficiently. 


B. Eigenstates of H inside the subspace F 


The eigenstate corresponding to E_ is called a bonding state, and the one 
corresponding to E,, an antibonding state, since E, always remains greater than 
the energy — E, of the system formed by a hydrogen atom in the ground state 
and an infinitely distant proton. 
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According to (45): 


pro rt aa +(e +1-2)s] (49) 
p p 
System (20) then gives: 

c1ic,=0 (50) 


The bonding and antibonding states are therefore symmetrical and antisymmetrical 
linear combinations of the kets | g, > and | p, >. To normalize them, it must be 
recalled that |g, > and | py, > are not orthogonal (their scalar product is equal 
to S). We therefore obtain: 


lv, > =—— [lo > - 1029] (51-a) 
v(i — 5) 


ysi Hlo (51-b) 
Vv 2(1 + $) 


Note that the bonding state ly >, associated with E_, is symmetrical under 
exchange of | p, > and | p, >, while the antibonding state is antisymmetrical. 


COMMENT: 


It could have been expected that the eigenstates of H inside the 
subspace ¥ would be symmetrical and antisymmetrical combinations of | p, > 
and | P, >: for given positions of the two protons, there is symmetry with 
respect to the bisecting plane of P,P,, and H remains unchanged if the roles 
of the two protons are exchanged. 


The bonding and antibonding states are approximate stationary states of the 
system under study. We pointed out, furthermore, in complement E,,, that the 
variational method can give a valid approximation for the energies but gives a more 
debatable result for the eigenfunctions. It is instructive, however, to have an idea 
of the mechanism of the chemical bond, to represent graphically the wave 
functions associated with the bonding and antibonding states, which are often 
called bonding and antibonding molecular orbitals. To do so, we can, for example, 
trace the surfaces of equal |y] (the locus of points in space for which the modulus |y 
of the wave function has a given value). If y is real, we indicate by a + (or —) sign 
the regions in which it is positive (or negative). This is what is done in figure 4 
for y, and y _ (the surfaces of equal || are surfaces of revolution about the P,P, 
axis, and figure 4 only shows their cross sections in a plane containing P,P,). 
The difference between the bonding orbital and the antibonding orbital is striking. 
In the first one, the electronic cloud “streches out” to include both protons, while 
in the second one, the position probability of the electron is zero in the bisecting 
plane of P,P,. 


COMMENT: 


We can calculate the mean value of the potential energy in the state 
| Y _ >, which, if we use (51-b), (31) and (39), is equal to: 


1180 


www.elsolucionario.net 


A SIMPLE EXAMPLE OF THE CHEMICAL BOND: THE H; ION 


2 2 
ra EE liv 
e 


e e? š 
-k ra| alilo Sn ee? 


e? e? 
+<alZlo> tca 1Elo2>| 


2 1 
E -775l + 20 +a] (52) 
Subtracting this from (46-b), we obtain the kinetic energy : 


P? 
<T> =<? =<H-V) 


1 


=E 
114S 


(1 -S +a) (53) 


We shall discuss later (§ 5) to what extent (52) and (53) give good approxima- 
tions for the kinetic and potential energies. 


x 
© 
Q 
T 


FIGURE 4 


Schematic drawings of the bonding molecular orbital (fig. a) and the antibonding molecular orbital 
(fig. b) for the H] ion. We have shown the cross section in a plane containing P,P, of a family of 
surfaces for which the modulus |y| of the wave function has a constant given value. These are 
surfaces of revolution about P,P, (we have shown 4 surfaces, corresponding to 4 different values 
of |y|). The + and — signs indicated in the figure are those of the wave function (which is real) 
in the corresponding regions. The dashed line is the trace of the bisecting plane of P,P,, which is 
a nodal plane for the antibonding orbit. 


3. Critique of the preceding model. Possible improvements 
a. RESULTS FOR SMALL A 


What happens to the energy of the bonding state and the corresponding wave function 
when R —=> 0? 
We see from figure 3 that S. A and C approach. respectively. 1. 2E, and 2E, 
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when p —> 0. If we subtract the repulsion term. e?/R, of the two protons, to obtain the 
electronic energy. we find : 


= —3E (54) 


_ St ae 
R R70 I 


In addition, since | p, > approaches | p, >, |Y- > reduces to | p, > (the ground state Is of the 
hydrogen atom). 

This result is obviously incorrect. When R = 0, we have the equivalent* of a helium 
ion He*. The electronic energy of the ground state of H} must coincide. for R = 0. with that 
of the ground state of He*. Since the helium nucleus is a Z = 2 nucleus, this energy is 
(cf. complement Ay): 


— Z?E, = — 4E, (55) 


and not — 3£,. Furthermore, the wave function w_(r) should not approach 
pilt) = (nad) 12e7”%, but rather (na3/Z?) 20% with Z = 2 (the Bohr orbit is twice 
as small). This enables us to understand why the disagreement between the exact result and 
that of § 2 above becomes important for small values of R (fig. 2): this calculation uses atomic 
orbitals which are too spread out when the two protons are too close to each other. 

A possible improvement therefore consists of enlarging the family of trial kets because 
of these physical arguments and using kets of the form: 


|W > = c | @(Z)> + €, | @2(Z)> (56) 


where | p,(Z)> and | p,(Z) > are associated with 1s atomic orbitals of radius a)/Z centered 
at P, and P,. The ground state still corresponds, for reasons of symmetry, to c, = ¢). 
We consider Z like a variational parameter in seeking, for each value of R, the value of Z which 
minimizes the energy. 

The calculation can be performed completely in elliptic coordinates. We find (cf. fig. 5) 
that the optimal value of Z decreases from Z = 2 for R = 0 to Z = 1 for R —» 2. as it should. 

The curve obtained for AE _ is much closer to the exact curve (cf. fig. 2). Table I gives 
the values of the abscissa and ordinate of the minimum of AE_ obtained from the various 
models considered in this complement. It can be seen from this table that the energies found 
by the variational method are always greater than the exact energy of the ground state; in 
addition, we see that enlarging the family of trial kets improves the results for the energy. 


b. RESULTS FOR LARGE R 


When R —=> œ, we see from (48) that E, and E_ exponentially approach the same 
value — E,. Actually, this limit should not be obtained so rapidly. To see this, we shall use 
a perturbation approach, as in complement Cy, (Van der Waals forces) or Eyn (the Stark effect 
of the hydrogen atom). Let us evaluate the perturbation of the energy of a hydrogen atom (in 
the 1s state), situated at P,,produced by the presence of a proton P, situated at a distance R 
much greater than a)(p > 1). In the neighborhood of P,, the proton P, creates an electric 


* In addition to the two protons, the helium nucleus of course contains one or two neutrons. 
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FIGURE 5 


For each value of the internuclear distance, we have calculated the value of Z which minimizes the 
energy. For R = 0, we have the equivalent of a He* ion, and we indeed find Z = 2. For R > «po. 
we have essentially an isolated hydrogen atom, which gives Z = |. Between these two extremes, 
Z is a decreasing function of p. The corresponding optimal energies are represented by triangles 
in figure 2. 


| Equilibrium distance 
between the two protons | Depth of the minimum ; 
(the abscissa of AE _ 
of the minimum of 4£ ), 


Variational method of $ 2 (1s orbi- | i 
tals with Z = 1) .............. 2.50 dy | 1.76 eV | 


Variational method of $ 3-a(1s orbi- 
tals with variable Z)........... 2.00 a, | 2.35 eV 


Variational method of $ 3-b (hybrid ' 


orbitals with variable Z, Z’, 0). 2.00 a, | 2.73 eV 


| 
| , | 


| Exact values ..............o...... | 2.00 ao | 2.79 eV 


TABLE I 
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field E, which varies like 1/ R2. This field polarizes the hydrogen atom and causes an electric 
dipole moment D, proportional to E, to appear. The electronic wave function is distorted, and 
the barycenter of the electronic charge distribution moves closer to P, (fig. 6). E and D are 
both proportional to 1/R? and have the same sign. The electrostatic interaction between the 
proton P, and the atom situated at P, must therefore lower the energy by an amount which, 
like — E. D, varies like 1/R**. Consequently, the asymptotic behavior of AE, and 4£_ must 
vary, not exponentially, but like — a/R* (where a is a positive constant). 


PrO 


FIGURE 6 


Under the effect of the electric field E created by the proton P,, the electronic cloud of the hydrogen 
atom centered at P, becomes distorted, and this atom acquires an electric dipole moment D. An 
interaction energy results which decreases with 1/R* when R increases. 


It is actually possible to find this result by the variational method. Instead of linearly 
superposing 1s orbitals centered at P, and P,, we shall superpose hybrid orbitals y; and x3, 
which are not spherically symmetrical about P, and P,. 7, is obtained, for example, by linearly 
superposing a 1s orbital and a 2p orbital, both centered at P,**: 


Xalt) = pise) + a 93, lr) (57) 


and has a form analogous to the one shown in figure 6. Now, consider determinant (21). 
The non-diagonal elements H,, = < xı | H |x2> and S,, = <x, |Z2 > still approach zero 
exponentially when R —=> oo. This is because the product y, (r)x, (r) appears in the correspond- 
ing integrals; even though distorted, the orbitals x,(r) and y,(r) still remain localized in the 
neighborhoods of P, and P, respectively, and their overlap goes to zero exponentially when 
R —» œ.The two eigenvalues E, and E_ therefore both approach H,, = H,, when R —> œ, 
since determinant (21) becomes diagonal. 

Now, what does H,, represent? As we have seen (cf. $ 2-c), it is the energy of a hydrogen 
atom placed at P, and perturbed by the proton P, . The calculation of $ 2 neglected any polariza- 
tion of the 1s electronic orbital due to the effect of the electric field created by P,, and this is 
why we found an energy correction which decreased exponentially with R. However, if, as we 
are doing here, we take into account the polarization of the electronic orbital, we find a 
correction in — a/R*. The fact that, in (57), we consider only the mixing with the 2p orbital 
causes the value of a given by the variational calculation to be approximate (whereas the pertur- 
bation calculation of the polarization involves all the excited states, cf. Complement Eyn). 


* More precisely, the energy is lowered by — 4 E . D (cf. complement Exy, §1). 
** The symmetry axis of the 2p orbital is chosen along the straight line joining the two protons. 
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The two curves AE, and AE_ therefore do approach each other exponentially, since the 
difference between E, and E_ involves only the non-diagonal elements H,, and S,,, and 
their common value for large R approaches zero like — a/R* (fig. 7). 

The preceding discussion also suggests to us the use of polarized orbitals like the one 
in (57), not only for large R, but also for all other values of R as well. We would thus enlarge the 
family of trial kets and consequently improve the accuracy. In expression (57), we then 


FIGURE 7 


When p —> œ, the energies of 

the bonding and antibonding states 

approach each other exponentially. 

However, they approach their limiting 
AE value less rapidly (like 1/R*). 


AE, 


consider ø as a variation parameter, like the parameter Z which defines the Bohr radius a)/Z 
associated with the ls and 2p orbitals. To make the method even more flexible, we even choose 
different parameters Z and Z’ for @,, and @),. For each value of R, we then minimize the 
mean value of H in the state | y, > + | x2 > (which, for reasons of symmetry, is still the ground 
state), and we thus determine the optimal values of a, Z, Z’. The agreement with the exact 
solution then becomes excellent (cf. table T). 


4. Other molecular orbitals of the H; ion 


In the preceding sections, we constructed, by the variational method. 
a bonding molecular orbital and an antibonding molecular orbital from the ground 
state Is of each of the two hydrogen atoms which can be formed about the two 
protons. Of course, we chose the ls state because it was clear that this would be 
the best choice for obtaining an approximation of the ground state of the system of 
two protons and one electron. We can obviously envisage, with the method of 
linear combination of atomic orbitals (§2-a), using excited states of the hydrogen 
atom to obtain other molecular orbitals of higher energies. The main interest of 
these excited orbitals here will be to give us an idea of the phenomena which can 
come into play in molecules which are more complex than the Hj ion. For example, 
to understand the properties of a diatomic molecule containing several electrons, 
we can, in a first approximation, treat these electrons individually, as if they did not 
interact with each other. We thus determine the various possible stationary states for 
an isolated electron placed in the Coulomb field of the nuclei, and then place the 
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Tast 


electrons of the molecule in these states, taking the Pauli principle into account 
(chap. XIV, $ D-1) and filling the lowest energy states first (this procedure is analogous 
to the one described for many-electron atoms in complement Axy). In this section, we 
shall indicate the principal properties of the excited molecular orbitals of the Hž ion, 
while keeping in mind the possibilities of generalization to more complex molecules. 


a. SYMMETRIES AND QUANTUM NUMBERS. SPECTROSCOPIC NOTATION 


(i) The potential V created by the two protons is symmetric with respect to 
revolution about the P,P, axis, which we shall choose as the Oz axis. This means 
that V and, consequently, the Hamiltonian H of the electron, do. not depend on 
the angular variable gy which fixes the orientation about Oz of the MP,P, plane 
containing the Oz axis and the point M. It follows that H commutes with the 
component L, of the orbital angular momentum of the electron [in the { |r >} 


representation, L, becomes the differential operator ae which commutes with 
1 op 


any -independent operator]. We can then find a system of eigenstates of H which 
are also eigenstates of L,, and class them according to the eigenvalues mh of L.. 


(ii) The potential V is also invariant under reflection through any plane 
containing P,P,, that is, the Oz axis. Under such a reflection, an eigenstate of L, 
of eigenvalue mh is transformed into an eigenstate of L, of eigenvalue — mh (the 
reflection changes the sense of revolution of the electron about Oz). Because of the 
invariance of V, the energy of a stationary state depends only on |m]l. 

In spectroscopic notation, we label each molecular orbital with a Greek letter 


indicating the value of |m], as follows: 
Im] = 0 <> 0 
Im| = 1 <= z (58) 
[m| =2 <> ô 


(note the analogy with atomic spectroscopic notation : a, z, ô recall s, p, d). For 
example, since the ground state ls of the hydrogen atom has a zero orbital angular 
momentum, the two orbitals studied in the preceding sections are « orbitals 
(it can be shown that this is also true for the exact stationary wave functions, 
and not only for the approximate states obtained by the variational method). 

This notation does not use the fact that the two protons of the H} ion have 
equal charges. The o, z, ô classification of molecular orbitals therefore remains 
valid for a heteropolar diatomic molecule. 


(iii) In the HZ ion (and, more generally, in homopolar diatomic molecules), 
the potential V is invariant under reflection through the middle O of P,P,. We can 
therefore choose eigenfunctions of the Hamiltonian H in such a way that they have 
a definite parity with respect to the point O. For an even orbital, we add to the Greek 
letter which characterizes |m , an index g (from the German “gerade”); this index 
is u (“ungerade”) for odd orbitals. Thus, the bonding orbital obtained above from 
the ls atomic states is a o, orbital, while the corresponding antibonding orbital 
is G,- 
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(iv) Finally, we can use the invariance of H under reflection through the 
bisecting plane of P,P, to choose stationary wave functions which have a definite 
parity in this operation, that is, a parity defined with respect to the change in sign 
of the single variable z. Functions which are odd under this reflection are labeled 
with an asterisk. They are necessarily zero at all points of the bisecting plane 
of P,P,, like the orbital shown in figure 4-b; these are antibonding orbitals. 


COMMENT: 


Reflection through the bisecting plane of P,P, can be obtained by 
performing a reflection through O followed by a rotation of z about Oz. 
The parity (iv) is therefore not independent of the preceding symmetries 
(the “g” states will have an asterisk for odd |m| and none for even |m|: the 
situation is reversed for the “u` states). However, it is convenient to consider 
this parity, since it enables us to determine the antibonding orbitals 
immediately. 


b. MOLECULAR ORBITALS CONSTRUCTED FROM THE 2p ATOMIC ORBITALS 


If we start with the excited state 2s of the hydrogen atom, arguments analogous 
to those of the preceding sections will give a bonding o,(2s) orbital and an anti- 
bonding o*(2s) orbital, with forms similar to those in figure 4. We shall therefore 
concern ourselves instead with molecular orbitals obtained from the excited atomic 
states 2p. 


a.  Orbitals constructed from 2p, states 


We shall denote by | p3,, > and | p3,, > the atomic states 2p, (cf. comple- 
ment Ey, $ 2-b), centered at P, and P, respectively. The form of the corresponding 
orbitals is shown in figure 8 (note the choice of signs, indicated in the figure). 

By a variational calculation analogous to the one in $2, we can construct. 
starting with these two atomic states. two approximate eigenstates of the 
Hamiltonian (13). The symmetries recalled in $4-a imply that, to within a norma- 
lization factor, these molecular states can be written : 


| 2p, > + | 3. > (59-a) 
|p. > — | 03p, > (59-b) 


The shape of the two molecular orbitals so obtained can easily be deduced from 
ligure 8; they are shown in figure 9. 

The two atomic states 2p, are eigenstates of L, with the eigenvalue zero; 
the same is therefore also true of the two states (59). The molecular orbital asso- 
ciated with (59-a) is even and is written o,(2p,); the one corresponding to (59-b) 
is odd under a reflection through O as well as under a reflection through the 
bisecting plane of P,P,, and we shall therefore denote it by o*(2p,). 


B.  Orbitals constructed from 2p, or 2p, states 


We shall now start with the atomic states | p3, > and | O. >, with which 
are associated the real wave functions (cf. complement E,,,;, $ 2-b) shown in figure 10 
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(note that the surfaces of equal || whose cross sections in the xOz plane are given 
in figure 10 are surfaces of revolution, not about Oz, but about axes parallel to Ox 
and passing through P, and P,). Recall that the atomic orbital 2p, is obtained by 
the linear combination of eigenstates of L, corresponding to m = land m = — 1. 
The molecular orbitals constructed from these atomic orbitals therefore have |m| = 1; 


they are z orbitals. 


FIGURE 8 


Schematic representation of the 2p. atomic orbitals centered at P, and P, (the Oz axis is chosen 
along P,P,) and used as a basis for constructing the excited molecular orbitals 7,(2p_) and o%(2p,) 


shown in figure 9 (note the sign convention chosen). 


o,(2P,) an (2p,) 


Se el 
“ye 
ca 
we . 
Ae TT 
ve 
ATT 


FIGURE 9 


Schematic representation of the excited molecular orbitals : the bonding orbital c,(2p,) (fig. a) 
and the antibonding orbital o*(27,) (fig. b). As in figure 8, we have drawn the cross section in a plane 
containing P,P, of a constant modulus |y| surface. This is a surface of revolution about P,P}. 
The sign shown is that of the (real) wave function. The dashed-line curves are the cross sections 
in the plane of the figure of the nodal surfaces (|y| = 0). 


Here again, the approximate molecular states produced from the atomic 
states 2p, are the symmetrical and antisymmetrical linear combinations: 

| Pop. > + | Pp. > 

| Php. » 7 | Op, > 


(60-a) 
(60-b) 
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X 
C+) C+) 
P, P, 
O ee oe AU A A AA A A 
FIGURE 10 


Schematic representation of the atomic orbitals 2p, centered at P, and P, (the Oz axis is chosen 
along P,P) and used as a basis for constructing the excited molecular orbitals 7,(27,) and 1; (2p,) 
shown in figure 11. For each orbital, the surface of equal |y|, whose cross section in the xOz plane 
is shown, is a surface of revolution, no longer about Oz, but about a straight line parallel to Ox and 
passing either through P, or P). 


T (2P x) nlp.) 


JE 


FIGURE ll 


Schematic representation of the excited molecular orbitals: the bonding orbital 7,(2p,) (fig. a) 
and the antibonding orbital 7*(2p,) (fig. b). For each of these two orbitals, we have shown the cross 
section in the xOz plane of a surface on which |y| has a given constant value. This surface is no 
longer a surface of revolution but is simply symmetrical with respect to the xOz plane. The meaning 
of the signs and the dashed lines is the same as in figures 4, 8, 9, 10. 


The form of these molecular orbitals can easily be qualitatively deduced from 
figure 10. The surfaces of equal y| are not surfaces of revolution about Oz, but 
are simply symmetrical with respect to the xOz plane. Their cross sections in this 
plane are shown in figure 11. We see immediately in this figure that the orbital 
associated with state (60-a) is odd with respect to the middle O of P,P, but even 
with respect to the bisecting plane of P,P,; it will therefore be denoted by x,(2p..). 


1189 


www.elsolucionario.net 


COMPLEMENT Gy, 


On the other hand, the orbital corresponding to (60-b) is even with respect to 
point O and odd with respect to the bisecting plane of P,P, : it is an antibonding 
orbital, denoted by 2%(2p,). We stress the fact that these z orbitals have planes of 
symmetry, not axes of revolution like the o orbitals. 

Of course, the molecular orbitals produced by the atomic states 2p, can be 
deduced from the preceding ones by a rotation of 2/2 about P,P). 

n orbitals analogous to the preceding ones are involved in the double or 
triple bonds of atoms such as carbon (cf. complement E,,,, $8 3-c and 4-c). 


COMMENT: 


We saw earlier ($2-d) that the energy separation of the bonding and 
antibonding levels is due to the overlap of the atomic wave functions. Now, 
for the same distance R, the overlap of the Pip. and Qn. orbitals, which point 
towards each other, is larger than that of p3,_ and p3,_, whose axes are paral- 
lel (fig. 8 and 10). We see that the energy difference between a, (2p,) and of (2p,) 
is larger than that between 1,(2p,) and 13(2p,) [or 2,(2p,) and x*(2p,)]. The 
hierarchy of the corresponding levels is indicated in figure 12. 


ož 2p, 


ne 2p, n* 2p, 


2p7, 2Px, 2p, 


Ty 2p, Ty 2p, 


O, 2p, 
FIGURE 12 


The energies of the various excited molecular orbitals constructed from the atomic orbitals 2p,, 
2p, and 2p, centered at P, and P, (the Oz axis is chosen along P, P,). By symmetry, the molecular 
orbitals produced by the 2p, atomic orbitals are degenerate with those produced by the 2p, atomic 
orbitals. The difference between the bonding and antibonding molecular orbitals z,(2p,,) and 
T3 (2Px. y) is, however, smaller than the corresponding difference between the 9,(2p,) and o*(2p,) 
molecular orbitals. This is due to the larger overlap of the two 2p, atomic orbitals. 
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5. The origin of the chemical bond ; the virial theorem 
a. STATEMENT OF THE PROBLEM 


When the distance R between the protons decreases, their electrostatic 
repulsion e?/R increases. The fact that the total energy E_(R) of the bonding state 
decreases (when R decreases from a very large value) and then passes through 
a minimum therefore means that the electronic energy begins by decreasing faster 
than e?/R increases (of course, since this term diverges when R —= 0, it is the 
repulsion between the protons which counts at short distances). We can then ask 
the following question : does the lowering of the electronic energy which makes 
the chemical bond possible arise from a lowering of the electronic potential energy 
or from a lowering of the kinetic energy or from both ? 

We have already calculated, in (52) and (53), approximate expressions for 
the (total) potential and kinetic energies. We might then consider studying the 
variation of these expressions with respect to R. Such a method, however, would 
have to be used with caution, since, as we have already pointed out, the eigen- 
functions supplied by a variational calculation are much less precise than the 
energies. We shall discuss this point in greater detail in §5-d-B below. 

Actually, it is possible to answer this question rigorously, thanks to the 
“virial theorem”, which provides exact relations between E(R) and the average 
kinetic and potential energies. Therefore, in this section, we shall prove this 
theorem and discuss its physical consequences. The results obtained, furthermore, 
are completely general and can be applied, not only to the molecular ion HZ, but 
also to all other molecules. Before considering the virial theorem itself, we shall 
begin by establishing some results which we shall need later. 


b. SOME USEFUL THEOREMS 


A. Euler’s theorem 


Recall that a function f (x,, x5. ... x,) of several variables x,, X3, ... Xx, is said 
to be homogeneous of degree s if it is multiplied by 4* when all the variables are 
multiplied by A: 


TAX A ee AX, ) = AS (Xi, Xas oe Xp) (61) 


For example. the potential of a three-dimensional harmonic oscillator : 


1 
V(x, y, z) = 5 mw?(x? + y? +2?) (62) 


is homogeneous of degree 2. Similarly, the electrostatic interaction energy of two 
particles: 
ese» =R Cal; (63) 
la HE, Y 


is homogeneous of degree — 1. 
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Euler’s theorem indicates that any function f which is homogeneous of degree s 
satisfies the identity : 


2 Xia = BT UX pee Xis cas Xp) (64) 


To prove this, we calculate the derivatives with respect to 4 of both sides of (61). The left-hand 
side yields : 


af ô a 
F x (Axi, s Aq) x (Xi) = Eo, (ey Ax,) (65) 


and the right-hand side yields: 
SATE f (Xi, s Xn) (66) 
If we set (65) equal to (66), with 4 = 1, we obtain (64). 


Euler’s theorem can very easily be verified in examples (62) and (63). 


Pp. The Hellman-Feynman theorem 


Let H(A) be a Hermitian operator which depends on a real parameter å, 
and | y(4) >, a normalized eigenvector of H(A) of eigenvalue E(A): 


H(A) |Y) > = EA) |yw()> (67) 
< yl) |y) = 1 (68) 


The Hellmann-Feynman theorem indicates that : 


d d 
$ BUA) = <A HA) YA) > (69) 


This relation can be proven as follows. According to (67) and (68), we have: 
E(4) = < YA) | HA) | yA) > (70) 
If we differentiate this relation with respect to 4, we obtain: 


d d 
q El) = < WA) 1 A) | WA) > 


d 
IU o 
that is, using (67) and the adjoint relation [H(A) is Hermitian, hence E(A) is real]: 


d d 
JEA = < WA) | 7 AA) | vA) > 


+ EQ) E wajia tawaia] } (72 
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On the right-hand side, the expression inside curly brackets is the derivative 
of < W(A) | W(A) >, which is zero since | w(2) > is normalized ; we therefore find (69). 
y. Mean value of the commutator (H, A] in an eigenstate of H 


Let | yw > be a normalized eigenvector of the Hermitian operator H, of 
eigenvalue E. For any operator A: 


<W|[H, A]|y> =0 (73) 
since, as H| y» > = Ely >»and<y | H = EXyl: 

< y | (HA — AM) [y >) = EX Y lAl > - EXy|4[y>=0 (74) 
c. THE VIRIAL THEOREM APPLIED TO MOLECULES 


a. The potential energy of the system 


Consider an arbitrary molecule composed of N nuclei and Q electrons. 
We shail denote by rf(k = 1, 2, ..., N) the classical positions of the nuclei, and 
by rf and pf (i = 1, 2, ..., Q) the classical positions and momenta of the electrons. 
The components of these vectors will be written xf, yz, z;, etc. 

We shall use the Born-Oppenheimer approximation here, considering the r? 
as given classical parameters. In the quantum mechanical calculation, only 
the r; and pf become operators, R? and PF. We must therefore solve the eigenvalue 
equation : 


He", at) |W (t, st) > = Et, wt) | et, ut) > (75) 


of a Hamiltonian H which depends on the parameters r;, ..., ry and which acts in 
the state space of the electrons. The expression for H can be written: 


H = T, + Vir, rA) (76) 


where T, is the kinetic energy operator of the electrons: 


2 1 
= — (P+*y? 7 
Te = ¥ gt (77) 
and V(r}, ..., ry) is the operator obtained by replacing the rf by the operators R$ in 


the expression for the classical potential energy. The latter is the sum of the 
repulsion energy V,, between the electrons, the attraction energy V,, between the 
electrons and the nuclei, and the repulsion energy Y, between the nuclei, so that : 


VOC so BN) = Voe + Vent ao EN) + Valtis o rN) (78) 


Actually, since V,, depends only on the r{ and does not involve the RS, 
y, is a number and not an operator acting in the state space of the electrons. 
The only effect of V, is therefore to shift all the energies equally, since equation (75) 
is equivalent to: 


1193 


www.elsolucionario.net 
COMPLEMENT Gx; 
Hr), ... th) | Wr, Th) > = Ed. th) | MET, Pk) > (79) 
where: 
Fle ost) = Tye + Veg + Vol, PR) = H— Vil, BK) (80) 
and where the electronic energy E, is related to the total energy E by: 
EN eet) = Elis. rA) — VI E) (81) 


We can apply Euler’s theorem to the classical potential energy, since it is 
a homogeneous function of degree — 1 of the set of electronic and nuclear 
coordinates. Since the operators R$ all commute with each other, we can find the 
relation between the quantum mechanical operators : 


N Q 
ES + 2 R VV = —V (82) 


where V} and V? denote the operators obtained by substitution of the Rf for 
the rf in the gradients with respect to rf and rf in the classical expression for the 
potential energy. Relation (82) will serve as the foundation of our proof of the virial 
theorem. 


B. Proof of the virial theorem 
We apply (73) to the special case in which: 
Q 
A= Y R.P? (83) 
i=1 


To do so, we find the commutator of H with 4: 


Q Q 
ESCRITOR 


i= i=1 x,y,z 


- EX Priv } (84) 


| 
S 
Mo 


i=l 


(we have used the commutation relations of a function of the momentum with 
the position, or vice versa ; cf. complement B,,. $4-c). The first term inside the curly 
brackets is proportional to the kinetic energy T,. According to (82), the second 
term is equal to: 


N 
—~V— Y ri Viv (85) 
k=1 
Consequently, relation (73) gives us: 
N 
2¢7,> CV > +3 Ec Viv > = 0 (86) 
k=1 


that is, since the Hamiltonian H depends on the parameters r? only through V: 


N 
A KM = LV (87) 
k= 
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The components rg here play a role analogous to that of the parameter 4 in (69). 
Application of the Hellmann-Feynman theorem to the right-hand side of equa- 
tion (87) then gives: 


N 
20 Ty POV eS — YY rg. WREG. stp nry) (88) 


k=1 
Furthermore, we obviously have: 
<T> FLV) = Ett, rN) (89) 
We can then easily find from (88) and (89): 


N 
(T,)=-E- ErpoviE 
k=1 
N 
<V>=2E + Y rf. Vie | (90) 
k=1 | 


Thus, we obtain a very simple result: the virial theorem applied to molecules. 
It enables us to calculate the average kinetic and potential energies if we know 
the variation of the total energy with respect to the positions of the nuclei. 


COMMENT : 


The total electronic energy E, and the electronic potential energy < V, > 
are also related by: 


. . 
<V, =2E, + y Fr. Vi Es (91) 
k=1 | 


This relation can be proven by substituting (81) and the explicit expression for V,,, in 
terms of the rj into the second relation of (90). However, it is simpler to note that the 
electronic potential energy V, = V,, + V, like the total potential energy V. is a homo- 
geneous function of degree — | of the coordinates of the system of particles. 
Consequently. the preceding arguments apply to H, as well as to H, and we can simul- 
taneously replace E by E, and V by V, in both relations (90). 


Y. A special case: the diatomic molecule 


When the number N of nuclei is equal to two, the energies depend only on the 
internuclear distance R. This further simplifies the expression for the virial theorem, 
which becomes: 


dE 
(T.> = —E-RaR 
pa (92) 
| <V> =2E ER 


| 
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Since E depends on the nuclear coordinates only through R, we have: 


ðE dE R 
— = —— (93) 
Ox, dR Ox; 
and, consequently : 
COE dE OR 
L 24 = LA (94) 


k=1,2 x,y,z Ox, dR k=1,2 x,y,z Ox, 


Now, the distance R between the nuclei is a homogeneous function of degree 1 of the coordinates 
of the nuclei. Application of Euler’s theorem to this function enables us to replace the double 
summation appearing on the right-hand side of (94) by R, and we finally obtain: 


dE 
" VIE = R— 95 
x k dR ( ) 


When this result is substituted into (90), it gives relations (92). 
In (92) as in (90), we can replace E by E, and V by V.. 


d. DISCUSSION 


o. The chemical bond is due to a lowering of the electronic potential energy 


Let E,, be the value of the total energy E of the system when the various 
nuclei are infinitely far apart. If it is possible to form a stable molecule by moving 
the nuclei closer together, there must exist a certain relative arrangement of these 
nuclei for which the total energy E passes through a minimum E, < E,,. For the 
corresponding values of rg, we then have: 


V'E =0 (96) 


Relations (90) then indicate that, for this equilibrium position, the kinetic and 
potential energies are equal to: 


< T, Yo = Eo 
< 4 o = 2E (97) 
Furthermore, when the nuclei are infinitely far from each other, the system is 


composed of a certain number of atoms or ions without mutual interactions (the 
energy no longer depends on the rý). For each of these subsystems, the virial 


theorem indicates that < T, > = — E, < V > = 2E, and, for the system as a whole, 
we must therefore also have: 

lT» == Es 

(Voy, =2E, (98) 
Subtracting (98) from (97) then gives: 

(Edo — < Ede = — (Eo — Ex) > 0 

(V0 = <V» = 2Eo — E) < 0 (99) 


The formation of a stable molecule is therefore always accompanied by an increase 
in the kinetic energy of the electrons and a decrease in the total potential energy. 
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The electronic potential energy must, furthermore, decrease even more since the 
mean value < V,, > (the repulsion between the nuclei), which is zero at infinity, is 
always positive. It is therefore a lowering of the potential energy of the elec- 
trons < Ve + Van > that is responsible for the chemical bond. At equilibrium, this 
lowering must outweigh the increase in < T, > and < V,, >. 


B. The special case of the H} ion 


(i) Application of the virial theorem to the approximate variational energy. 

We return to the study of the variation of < T, > and < V > for the H} ion. We shall 
begin by examining the predictions of the variational model of $ 2, which led to the approximate 
expressions (52) and (53). From the second of these relations, we deduce that: 


aT, = <1, ) kdd = 


ag (A - 28 E) (100) 


FIGURE 13 


The electronic kinetic energy < 7; > and the potential energy < V > of the HF ion as functions of 
p = R/a, (for purposes of comparison, we have also shown the total energy E = <T> + < V5). 
. solid fines: the exact values (the chemical bond is due to the fact that < V > decreases a little 
faster than < T, > increases). 

. long dashes: the mean values calculated from the bonding wave function given by the simple 
variational method of $ 2. 

. Short dashes: the values obtained by the application of the virial theorem to the energy given by 
the same variational calculation. 
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Since S is always greater than A/2E, (cf. fig. 3), this calculation would tend to indicate that AT, 
is always negative. This appears, moreover, in figure 13. where the dashed lines represent the 
variations of the approximate expressions (52) and (53). In particular, we see that, according to 
the variational calculation, AZ is negative at equilibrium (p ~ 2.5) and AV is positive. These 
results are both incorrect, according to (99). We see here the limits of a variational calculation, 
which gives an acceptable value for the total energy < J + V >, but not for < J > and < V> 
separately. The mean values of the latter depend too strongly on the wave function. 

The virial theorem enables us. without having to resort to the rigorous calculation 
mentioned in $1-c, to obtain a much better approximation for < T, > and < V >. All we need 
to do is apply the exact relations (92) to the energy E calculated by the variational method. 
We should expect to obtain an acceptable result, since the variational approximation is now used 
only to supply the total energy E. The values thus obtained for < T, > and < V > are represented 
by short dashed lines in figure 13. For purposes of comparison, we have shown in solid lines the 
exact values of < T, > and < V > (obtained by application of the virial theorem to the solid-line 
curve of figure 2). First of all, we see that for p = 2.5, the curve in short dashed lines indicates, 
as we should expect, that AT, is positive and AV is negative. In addition, the general shape 
of these curves reproduces rather well that of the solid-line curves. As long as p > 1.5, the 
virial theorem applied to the variational energy does give values which are very close to reality. 
This represents a considerable improvement over the direct calculation of the mean values in 
the approximate states. 


(if) Behavior of < T> and < V > 


The solid-line curves of figure 13 (the exact curves) show that < T, > —> 4E, and 
< V > —> + œ when R —> 0. Indeed, when R = 0; we have the equivalent of a He” ion 
for which the electronic kinetic energy is 4£,. The divergence of < V > is due to the term 
< Va > = e?/R, which becomes infinite when R —> 0 (the electronic potential energy 
<V,> =< V)> — e?/R remains finite and approaches — 8£,, which is indeed its value in 
the He* ion). 

The behavior for large R deserves a more detailed discussion. We have seen above (§ 3-b) 
that the energy E_ of the ground state behaves, for R > ap. like: 


A A (101) 
R4 


where a is a constant which is proportional to the polarizability of the hydrogen atom. 
By substituting this result into formulas (92), we obtain: 


CV) = — 28, + (102) 


When R decreases from a very large value, < T, > begins by decreasing with 1/R* from its asymp- 
totic value E,, and < V > begins by increasing from — 2£,. These variations then change sign 
(this must be so since < T >o is larger than < J >,, and < V >, is smaller than < V>,): as R 
continues to decrease (cf. fig. 13), < T, > passes through a minimum and then increases until 
it reaches its value 4E, for R = 0. As for the potential energy < V >, it passes through a maximum, 
then decreases, passes through a minimum, and then approaches infinity when R —> 0. How 
can we interpret these variations? 

As we have noted several times, the non-diagonal elements A, , and H,, of determinant (21) 
approach zero exponentially when R —=> oo. We can therefore argue only in terms of H,, 
or H,, in discussing the variation of the energy of the H} ion at large internuclear distances. 
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The problem is then reduced to the study of the perturbation of a hydrogen atom centered 
at P, by the electric field of the proton P,. This field tends to distort the electronic orbital by 
stretching it in the P, direction (cf. fig. 6). Consequently, the wave function extends into 
a larger volume. According to Heisenberg's uncertainty relations, this allows the kinetic energy 
to decrease ; this can explain the behavior of < T, > for large R. 

Arguing in terms of A,,, we can also explain the asymptotic behavior of < V >. The dis- 
cussion of §3-b showed that, for R > ap. the polarization of the hydrogen atom situated at P, 


e e 
makes its interaction energy < — FE + A > with P, slightly negative (proportional to — 1/R*). 
1 2 
: e : 
If < V > is positive, it is because the potential energy < — — > of the atom at P, increases 
; r, 


¿2 2 
more rapidly, when P, is brought closer to P,, than < — Ly = > decreases. This increase 
2 r 
in < — = > is due to the fact that the attraction of P, moves the electron slightly away from P, 

2 
and carries it into regions of space in which the potential created by P, is less negative. 

For R = R, (the equilibrium position of the HZ ion), the wave function 
of the bonding state is highly localized in the region between the two protons. 
The decrease in < V > (despite the increase in e?/R) is due to the fact that the 
electron is in a region of space in which it benefits simultaneously from the attraction 
of both protons. This lowers its potential energy (cf. fig. 14). This combined 
attraction of the two protons also leads to a decrease in the spatial extension of 
the electronic wave function, which is concentrated in the intermediate region. 
This is why, for R close to Rọ, < T, > increases when R decreases. 


FIGURE 14 
Variation of the potential energy V, of the electron subjected to the simultaneous attraction of the 
two protons P, and P, as one moves along the line P,P,. In the bonding state, the wave function 
is concentrated in the region between P, and P,, and the electron benefits simultaneously from the 
attraction of both protons. 


References and suggestions for further reading (H ion, H, molecule, nature of the 
chemical bond, etc.) : 

Pauling (12.2); Pauling and Wilson (1.9), chaps. XII and XIII; Levine (12.3), 
chaps. 13 and 14; Karplus and Porter (12.1), chap. 5, § 6; Slater (1.6), chaps. 8 and 9; 
Eyring et al (12.5), chaps. XI and XII; Coulson (12.6), chap. IV; Wahl (12.13). 
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Complement Hy, 


EXERCISES 

1. A particle of mass m is placed in an infinite one-dimensional well of width a: 
V(x) = 0 for O<x<a 
Víx) = + œ everywhere else 


It is subject to a perturbation W of the form: 


W(x) = any (x =- 5) 


where w, is a real constant with the dimensions of an energy. 
a. Calculate, to first order in wọ, the modifications induced by W(x) in the 
energy levels of the particle. 


b. Actually, the problem is exactly soluble. Setting k = y 2mE/h?, show that 
the possible values of the energy are given by one of the two equations sin (ka/2) = 0 
or tan (ka/2) = — h*k/maw, (as in exercise 2 of complement L,, watch out for the 
discontinuity of the derivative of the wave function at x = a/2). 

Discuss the results obtained with respect to the sign and size of wọ. In the 
limit wọ —> 0, show that one obtains the results of the preceding question. 


2. Consider a particle of mass m placed in an infinite two-dimensional potential 
well of width a (cf. complement G,,): 


V(x, y) = 0 if O<x<a and 0<y<a 
V(x, y) = + oo everywhere else 


This particle is also subject to a perturbation W described by the potential: 
W(x, y}= wo for 0<x< 


W(x, y) =0 everywhere else 


a. Calculate, to first order in w,, the perturbed energy of the ground state. 
b. Same question for the first excited state. Give the corresponding wave 
functions to zeroeth order in wọ. 
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3. A particle of mass m, constrained to move in the xOy plane, has a Hamiltonian: 


Po P ç 1 
=L +i +i mox? 2 
Ho zm tag Tamo xX + Yo) 


(a two-dimensional harmonic oscillator, of angular frequency œ). We want to study 
the effect on this particle of a perturbation W given by: 


where À; and A, are constants, and the expressions for M and W, are: 


W, = mW’ XY 
2 
mol — 2) 
h? 


(L, is the component along Oz of the orbital angular momentum of the particle). 
In the perturbation calculations, consider only the corrections to first order 
for the energies and to zeroeth order for the state vectors. 


W, 


a. Indicate without calculations the eigenvalues of H,, their degrees of 
degeneracy and the associated eigenvectors. 
In what follows, consider only the second excited state of Ho, of energy 3hw 
and which is three-fold degenerate. 


b. Calculate the matrices representing the restrictions of W, and W, to the 
eigensubspace of the eigenvalue 3hm of Ho. 


c. Assume 4, = Oand 4, < l. 
Calculate, using perturbation theory, the effect of the term 4, W, on the second 
excited state of Ho. 


d. Compare the results obtained in c with the limited expansion of the exact 
solution, to be found with the help of the methods described in complement Hy 
(normal vibrational modes of two coupled harmonic oscillators). 


e. Assume 4, < A, < 1. Considering the results of question c to be a new 
unperturbed situation, calculate the effect of the term 4, W. 


f. Now assume that 4, = O and 4, < I. 
Using perturbation theory, find the effect of the term 4,W, on the second 
excited state of Ho. 


g. Compare the results obtained in f with the exact solution, which can be 
found from the discussions of complement D,,. 


h. Finally, assume that 4, < 4, « 1. Considering the results of question f 
to be a new unperturbed situation, calculate the effect of the term 4, W. 
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4. Consider a particle P of mass u, constrained to move in the xOy plane in 
a circle centered at O with fixed radius p (a two-dimensional rotator). The only 
variable of the system is the angle a = (Ox, OP), and the quantum state of the 
particle is defined by the wave function y(x) (which represents the probability 
amplitude of finding the particle at the point of the circle fixed by the angle a). 
At each point of the circle, w(a) can take on only one value, so that: 


yla + 27) = yla) 


Y (0) is normalized if: 


hd | 
i da’ 
values and normalized eigenfunctions of M. What is the physical meaning of M? 


a. Consider the operator M = s M Hermitian? Calculate the eigen- 


b. The kinetic energy of the particle can be written: 


2up? 


Ho 


Calculate the eigenvalues and eigenfunctions of H,. Are the energies degenerate? 


c. Att = 0, the wave function of the particle is N cos? a (where N is a normal- 
ization coefficient). Discuss the localization of the particle on the circle at a 
subsequent time /. 


d. Assume that the particle has a charge y and that it interacts with a uniform 
electric field 6 parallel to Ox. We must therefore add to the Hamiltonian H, the 
perturbation: 


W = — gé pcos a 


Calculate the new wave function of the ground state to first order in &. Deter- 
mine the proportionality coefficient y (the linear suceptibility) between the electric 
dipole parallel to Ox acquired by the particle and the field @. 


e. Consider, for the ethane molecule CH, —CH,, a rotation of one CH, group 
relative to the other about the straight line joining the two carbon atoms. 

To a first approximation, this rotation is free, and the Hamiltonian H, intro- 
duced in b describes the rotational kinetic energy of one of the CH, groups relative 
to the other (2up* must, however, be replaced by A/. where / is the moment of inertia 
of the CH, group with respect to the rotational axis and 4 is a constant). To take 
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account of the electrostatic interaction energy between the two CH, groups, we add 
to A, a term of the form: 


W = bcos 3x 


where b is a real constant. 

Give a physical justification for the a-dependence of W. Calculate the energy 
and wave function of the new ground state (to first order in b for the wave function 
and to second order for the energy). Give a physical interpretation of the result. 


5. Consider a system of angular momentum J. We confine ourselves in this 
exercise to a three-dimensional subspace, spanned by the three kets | + 1 >, |0 >, 
| — 1 >, common eigenstates of J? (eigenvalue 2h?) and J, (eigenvalues + fi, 0, — A). 
The Hamiltonian Ho of the system is: 


where a and b are two positive constants, which have the dimensions of an angular 
frequency. 


a. What are the energy levels of the system? For what value of the racio b/a 
is there degeneracy ? 


b. A static field B, is applied in a direction u with polar angles 0 and gy. The 
interaction with B, of the magnetic moment of the system: 


M = yJ 

(y : the gyromagnetic ratio, assumed to be negative) is described by the Hamiltonian: 
W = 00, 

where œ = — 7 |Bo| is the Larmor angular frequency in the field B,, and J, is the 


component of J in the u direction: 
J, = J cos O + J, sin @ cos g + J, sin 0 sin o 
Write the matrix which represents W in the basis of the three eigenstates of Hp. 


c. Assume that b = a and that the u direction is parallel to Ox. We also have 
Wy < a. 

Calculate the energies and eigenstates of the system, to first order in wọ for the 
energies and to zeroeth order for the eigenstates. 


d. Assume that b = 2a and that we again have wọ < a, the direction of u 
now being arbitrary. 

In the {| + 1), [0 >, | — l> } basis, what is the expansion of the ground 
state | Wo > of Hy + W, to first order in wg? 
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Calculate the mean value < M > of the magnetic moment M of the system in 
the state | Yo >. Are < M > and B, parallel? 
Show that one can write: 


<M;> = È Xy B; 
J 


with i,j = x, y, z. Calculate the coefficients y, (the components of the susceptibility 
tensor). 


6. Consider a system formed by an electron spin S and two nuclear spins I, 
and I, (S is, for example, the spin of the unpaired electron of a paramagnetic 
diatomic molecule, and I, and I, are the spins of the two nuclei of this molecule). 

Assume that S. I. I, are all spin 1/2’s. The state space of the three-spin system 
is spanned by the eight orthonormal kets |gs. £,,2, >, common eigenvectors 
of S., Ziz /,,, with respective eigenvalues esh/2, ¢,h/2, ¢,h/2 (with es = +,€, = +. 
£, = +). For example, the ket | +, —. + > corresponds to the eigenvalues + h/2 
for S,, — h/2 for Z.. and + h/2 for /,.. 


a. We begin by neglecting any coupling of the three spins. We assume, however, 
that they are placed in a uniform magnetic field B parallel to Oz. Since the gyro- 
magnetic ratios of I, and I, are equal, the Hamiltonian H, of the system can be 
written : 


H = QS, + oli- + ol, 


where Q and w are real, positive constants, proportional to |B|. Assume Q > 2w. 
What are the possible energies of the three-spin system and their degrees of 
degeneracy ? Draw the energy diagram. 


b. We now take coupling of the spins into account by adding the Hamiltonian: 
W=aS.I, +a8S.!, 


where a is a real, positive constant (the direct coupling of E, and I, is negligible). 

What conditions must be satisfied by es, €,. 65. es. £l. £3 for aS. I, to have 
a non-zero matrix element between | es. £1. £2 > and | ez. £1. £3 >? Same question for 
aS.L. 


c. Assume that: 
ah? < hQ, hw 


so that W can be treated like a perturbation with respect to Ho. To first order in W, 
what are the eigenvalues of the total Hamiltonian H = H, + W? To zeroeth order 
in W, what are the eigenstates of A? Draw the energy diagram. 
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d. Using the approximation of the preceding question, determine the Bohr 
frequencies which can appear in the evolution of < S, > when the coupling W of 
the spins is taken into account. 

Inan E.P.R. (Electronic Paramagnetic Resonance) experiment, the frequencies 
of the resonance lines observed are equal to the preceding Bohr frequencies. What is 
the shape of the E.P.R. spectrum observed for the three-spin system? How can the 
coupling constant « be determined from this spectrum ? 


e. Now assume that the magnetic field B is zero, so that Q = œw = 0. The 
Hamiltonian then reduces to W. 


x. Let I = 1, + I, be the total nuclear spin. What are the eigenvalues of I? 
and their degrees of degeneracy? Show that W has no matrix elements between 
eigenstates of 1? of different eigenvalues. 


B. Let J = S + I be the total spin. What are the eigenvalues of J? and their 
degrees of degeneracy? Determine the energy eigenvalues of the three-spin 
system and their degrees of degeneracy. Does the set { J*. J, } form a C.S.C.O.? 


Same question for (12, J?. J. }. 


7. Consider a nucleus of spin / = 3/2, whose state space is spanned by the four 
vectors |m) (m = + 3/2, + 1/2. — 1,2. — 3/2), common eigenvectors of I? 
(eigenvalue 15?/4) and 7, (eigenvalue ih). 

This nucleus is placed at the coordinate origin in a non-uniform electric field 
derived from a potential U(x, y, z). The directions of the axes are chosen such that, 
at the origin: 


Recall that U satisfies Laplace’s equation: 
AU =0 


We shall assume that the interaction Hamiltonian between the electric field 
gradient at the origin and the electric quadrupole moment of the nucleus can be 
written : 


Hy =—#2 4 [uP + ay + 0,82] 
2121 — Ihe Á 


where g is the electron charge, O is a constant with the dimensions of a surface and 
proportional to the quadrupole moment of the nucleus, and: 


ae ear AN a) 
CL cul cu 
d, = > a, = Li a= 
À As? y Aa 5 2 
Ox" So Cy" o CZ Zo 


(the index 0 indicates that the derivatives are evaluated at the origin). 
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a. Show that, if U is symimetrical with respect to revolution about Oz, Ho has 
the form: 


Ho = ABE — WE + 1952] 


where A is a constant to be specified. What are the eigenvalues of //,. their degrees 
of degeneracy and the corresponding eigenstates ? 


b. Show that, in the general case, H, can be written: 
H, = ABE — Wi + 1)A?] + BUA +1) 


where A and B are constants, te be expressed in terms of a, and 4, 

What is the matrix which represents H, in the { |m > } basis? Show that it 
can be broken down into two 2 x 2 submatrices. Determine the eigenvalues of Ho 
and their degrees of degeneracy, as well as the corresponding eigenstates. 


c. In addition to its quadrupole moment, the nucleus has a magnetic moment 
M = yl (y: the gyromagnetic ratio). Onto the electrostatic field is superposed 
a magnetic field Bo, of arbitrary direction u. We set wọ = — y |Bol. 

What term W must be added to H, in order to take into account the coupling 
between M and B,? Calculate the energies of the system to first order in Bo. 


d. Assume B, to be parallel to Oz and weak enough for the eigenstates found 
in b and ihe energies to first order in wọ found in ¢ to be good approximations. 

What are the Bohr frequencies which can appear in the evolution of </>? 
Deduce from them the shape of the nuclear magnetic resonance spectrum which 
can be observed with a radiofrequency field oscillating along Ow. 


8. A particle of mass 77 is placed in an infinite one-dimensional potential well 
of width a: 


V(x) = 0 forO <A <a 
) = +æ elsewhere 
Assume that this particle, of charge — q, is subject to a uniform electric field 4. 
with the corresponding perturbation W being: 


a 


a. Let e, and e, be the corrections to first- and second-order in 4 for the 
ground state energy. 

Show that e, is zero. Give the expression for e, in the form of a series, whose 
terms are to be calculated in terms of q, 6, m, a, h (the integrals given at the end of 
the exercise can be used). 
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b. By finding upper bounds for the terms of the series for ¢,, give an upper 
bound for e, (ef. §$B-2-c of chapter XI). Similarly, give a lower bound e), 
obtained by retaining only the principal term of the series. 

With what accuracy do the two preceding bounds enable us to bracket the 
exact value of the shift AE in the ground state to second order in £? 


c. We now want to calculate the shift 4£ by using the variational method. 
Choose as a trial function: 


w(x) = E sin (=) [ 4 296» T s)| 


where a is the variational parameter. Explain this choice of trial functions. 

Calculate the mean energy < H >(a) of the ground state to second order in & 
[assuming the expansion of < H > («)to second order in & to be sufficient |. Determine 
the optimal value of x. Find the result AE „a given by the variational method for the 
shift in the ground state to second order in £. 

By comparing A£,,, with the results of b, evaluate the accuracy of the 
variational method applied to this example. 

We give the integrals: 


2 { AN. (=) : (2 léna | 
= x — —) sin (— | sin dx = — EE 
a Jo 5) a a n> (1 — 4n?)? 


For all the numerical calculations, take 2? = 9.87. 


9. We want to calculate the ground state energy of the hydrogen atom by the 
variational method, choosing as trial functions the spherically symmetrical 
functions p,(r) whose r-dependence ts given by : 


pAr) = c(i — r) forr <a 
(r) = 0 forr > a 


C is a normalization constant and a is the variational parameter. 


a. Calculate the mean value of the kinetic and potential energies of the 
electron in the state | p, >. Express the mean value of the kinetic energy in terms 
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of Vip, so as to avoid the “delta functions” which appear in 4p (since Vo is 
discontinuous). 
b. Find the optimal value a, of «. Compare «, with the Bohr radius «o. 


c. Compare the approximate value obtained for the ground state energy with 
the exact value — E). 


10. We intend to apply the variational method to the determination of the energies 
of a particle of mass m in an infinite potential well: 

V(x) = 0 —-agxea 

V(x) = œ everywhere else 

a. We begin by approximating, in the interval [— a, + a], the wave function 
of the ground state by the simplest even polynomial which goes to zero atx = + a: 

w(x) = a? — x? for -—a<x <a 

w(x) = 0 everywhere else 
(a variational family reduced to a single trial function). 

Calculate the mean value of the Hamiltonian A in this state. Compare the 
result obtained with the true value. 


b. Enlarge the family of trial functions by choosing an even fourth-degree 
polynomial which goes to zero at x = + a: 


wiAx) = (a? — x? ja? —ax*) for —as<x<u 
W(x) = 0 everywhere else 
(a variational family depending on the real parameter «). 
(x) Show that the mean value of H in the state y(x) is: 
h? 330? — 42a + 105 
2ma* 2a? — 12% + 42 


< H (a) = 


(B) Show that the values of x which minimize or maximize < H > (x) are 
given by the roots of the equation: 


130? — 98a + 21 = 0 


(y) Show that one of the roots of this equation gives, when substituted 
into < H (a). a value of the ground state energy which is much more precise than 
the one obtained in a. 

(ò) What other eigenvalue is approximated when the second root of the 
equation obtained in 4-B is used? Could this have been expected? Evaluate the 
precision of this determination. 

c. Explain why the simplest polynomial which permits the approximation 
of the first excited state wave function is x(a? — x7). 
What approximate value is then obtained for the energy of this state? 
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2. Magnetic interactions related to proton spin : the hyperfine 
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C. THE FINE STRUCTURE 1. Statement of the problem 
OF THE n= 2 LEVEL a. Degeneracy of the n = 2 level 
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a. Eigenstates and eigenvalues of the Zeeman term 
b. Effects of the hyperfine term considered as a perturbation 
c. The Bohr frequencies involved in the evolution of < S, > 
4. The intermediate-field Zeeman effect 
a. The matrix which represents the total perturbation in the 
{| F. mp > } basis 
b. Energy values in an arbitrary field 
c. Partial hyperfine decoupling 
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CHAPTER XII THE FINE AND HYPERFINE STRUCTURE OF THE HYDROGEN ATOM 


A. INTRODUCTION 


The most important forces inside atoms are Coulomb electrostatic forces. We took 
them into account in chapter VII by choosing as the hydrogen atom Hamiltonian: 


p? 


de 


+ V(R) (A-1) 


The first term represents the kinetic energy of the atom in the center of mass frame 
(u is the reduced mass). The second term: 


V(R) = q l os e (A-2) 


represents the electrostatic interaction energy between the electron and the proton 
(y is the electron charge). In $C of chapter VII, we calculated in detail the 
eigenstates and eigenvalues of Hy. 

Actually, expression (A-1) is only approximate: it does not take any 
relativistic effects into account. In particular, all the magnetic effects related to the 
electron spin are ignored. Moreover, we have not introduced the proton spin 
and the corresponding magnetic interactions. The error is, in reality, very small, 
since the hydrogen atom is a weakly relativistic system (recall that, in the Bohr 
model, the velocity v in the first orbit n = 1 satisfies v/c = e*/hc = 1/137 < 1). 
In addition, the magnetic moment of the proton is very small. 

However, the considerable accuracy of spectroscopic experiments makes it 
possible to observe effects that cannot be explained in terms of the Hamiltonian (A-1). 
Therefore, we shall take into account the corrections we have just mentioned by 
writing the complete hydrogen atom Hamiltonian in the form: 


H=H,+W (A-3) 


where H, is given by (A-1) and where W represents all the terms neglected thus far. 
Since W is much smaller than Hp, it is possible to calculate its effects by using the 
perturbation theory presented in chapter XI. This is what we propose to do in this 
chapter. We shall show that W is responsible for a “fine structure”, as well as for a 

“hyperfine structure” of the various energy levels calculated in chapter VII. 
Furthermore, these structures can be measured experimentally with very great 
accuracy (the hyperfine structure of the Is ground state of the hydrogen atom is the 
physical quantity currently known to the largest number of significant figures). 
We shall also consider, in this chapter and its complements, the influence of an 
external static magnetic or electric field on the various levels of the hydrogen atom 
(the Zeeman effect and the Stark effect). 

This chapter actually has two goals. On the one hand, we want to use a concrete 
and realistic case to illustrate the general stationary perturbation theory discussed 
in the preceding chapter. On the other hand, this study, which bears on one of the 
most fundamental systems of physics (the hydrogen atom), brings out certain 
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concepts which are basic to atomic physics. For example. $B is devoted to a 
thorough discussion of various relativistic and magnetic corrections. This chapter, 
while not indispensable for the study of the last two chapters, presents concepts 
fundamental to atomic physics. 


B. ADDITIONAL TERMS IN THE HAMILTONIAN 


The first problem to be solved obviously consists of finding the expression for W. 


1. The fine-structure Hamiltonian 


a. THE DIRAC EQUATION IN THE WEAKLY RELATIVISTIC DOMAIN 


In chapter IX, we mentioned that the spin appears naturally when we try to 
establish an equation for the electron which satisfies both the postulates of special 
relativity and those of quantum mechanics. Such an equation exists: it is the 
Dirac equation, which makes it possible to account for numerous phenomena (elec- 
tron spin, the fine structure of hydrogen, etc.) and to predict the existence of posi- 
trons. 

The most rigorous way of obtaining the expression for the relativistic 
corrections [appearing in the term W of (A-3)] therefore consists of first writing 
the Dirac equation for an electron placed in the potential V(r) created by the proton 
(considered to be infinitely heavy and motionless at the coordinate origin). One then 
looks for its limiting form when the system is weakly relativistic, as is the case 
for the hydrogen atom. We then recognize that the description of the electron state 
must include a two-component spinor (cf. chap. IX, $C-1). The spin operators S.. 
S, S, introduced in chapter IX then appear naturally. Finally, we obtain an 
expression such as (A-3) for the Hamiltonian H, in which W appears in the form of 
a power series expansion in v/c which we can evaluate. 

It is out of the question here to study the Dirac equation, or to establish its 
form in the weakly relativistic domain. We shall confine ourselves to giving the 
first terms of the power series expansion in v/c of W and their interpretation. 


2 p+ h? 
H = m E = - fe ee, A E +——~ AV(R) +... 
2m, 8mic? 2m R dR 8m¿c? 
Y———y o _/ ae yor af ¿AR y ed 
Hy mi Wso Wp 


(B-1) 


We recognize in (B-1) the rest-mass energy m,c? of the electron (the first term) 
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and the non-relativistic Hamiltonian H, (the second and third terms)*. The 
following terms are called fine structure terms. 


COMMENT: 


Note that it is possible to solve the Dirac equation exactly for an electron placed 
in a Coulomb potential. We thus obtain the energy levels of the hydrogen atom without 
having to make a limited power series expansion in v/c of the eigenstates and eigenvalues 
of H. The “perturbation” point of view which we are adopting here is, however, very 
useful in bringing out the form and physical meaning of the various interactions which 
exist inside an atom. This will later permit a generalization to the case of many-electron 
atoms (for which we do not know how to write the equivalent of the Dirac equation). 


b. INTERPRETATION OF THE VARIOUS TERMS 
OF THE FINE-STRUCTURE HAMILTONIAN 


a. Variation of the mass with the velocity (W,,, term) 


(i) The physical origin 

The physical origin of the W,,, term is very simple. If we start with the 
relativistic expression for the energy of a classical particle of rest-mass m, and 
momentum p: 


E= exp? + me? (B-2) 


and perform a limited expansion of E in powers of |p|/m,c, we obtain : 


E = m? T re 
d 2m,  8m3e? 


e 


(B-3) 


In addition to the rest-mass energy (m,c?) and the non-relativistic kinetic energy 
(p?/2m,), we find the term — p*/8m3c?, which appears in (B-1). This term represents 
the first energy correction, due to the relativistic variation of the mass with the 
velocity. 

(ii) Order of magnitude 

To evaluate the size of this correction, we shall calculate the order of 
magnitude of the ratio W,,/H,: 


p4 
: IN 2 2 2 
AOS GO (2) as (=) (B-4) 
Ho p? 4m? 4e 137 

2m 


* Expression (B-1) was obtained by assuming the proton to be infinitely heavy. This is why it is 
the mass m, of the electron that appears, and not, as in (A-1), the reduced mass u of the atom. As far as 
H, is concerned, the proton finite mass effect is taken into account by replacing m, by u. However, 
we shall neglect this effect in the subsequent terms of H, which are already corrections. It would, moreover, 
be difficult to evaluate, since the relativistic description of a system of two interacting particles poses 
serious problems [it is not sufficient to replace m, by p in the last terms of (B-1)]. 
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since we have already mentioned that, for the hydrogen atom, v/e = a. Since 
Ho = 10 eV, we see that W,, = 107? eV. 


B.  Spin-orbit coupling (Wo, term) 


(i) The physical origin 
The electron moves at a velocity v = p/m, in the electrostatic field E created 
by the proton. Special relativity indicates that there then appears, in the electron 
frame, a magnetic field B’ given by: 
I 


B= —-—=vxE (B-5) 
c? 


to first order in v/c. Since the electron possesses an intrinsic magnetic moment 
M; = 4S/m,, it interacts with this field B’. The corresponding interaction energy 
can be written: 


W = —M,.B’ (B-6) 

Let us express W’ more explicitly. The electrostatic field E appearing in (B-5) is 
1 dV(r 

equal to — er where V(r) = — < is the electrostatic energy of the electron. 


From this, we get: 


1 1dV(r) p "e: (B-7) 
qer d m 


e 


In the corresponding quantum mechanical operator, there appears: 


PxR=-L (B-8) 
Finally, we obtain: 


mat LAR a tes (B-9) 
mic? R dR me? R? 


Thus we find, to within the factor 1/2*, the spin-orbit term Wg which appears 
in (B-1) This term then represents the interaction of the magnetic moment of 
the electron spin with the magnetic field “seen” by the electron because of its 
motion in the electrostatic field of the proton. 

(ii) Order of magnitude 

Since L and S are of the order of h, we have: 


E (B-10) 


* It can be shown that the factor 1/2 is due to the fact that the motion of the electron about the 
proton is not rectilinear. The electron spin therefore rotates with respect to the laboratory reference 
frame (Thomas precession: see Jackson (7.5) section 11-8, Omnès (16.13) chap. 4 $2. or Bacry (10,31) 
chap. 7 $5-d). 
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Let us compare W,, with Hy, which is of the order of e*/R: 


eh? 
Woo micR? h? 
Ho e? mc? R? 
R 


y. The Darwin term Wp 
(i) The physical origin 


(B-11) 


(B-12) 


In the Dirac equation, the interaction between the electron and the Coulomb 
field of the nucleus is “local”; it only depends on the value of the field at the 
electron position r. However, the non-relativistic approximation (the series expansion 
in v/c) leads, for the two-component spinor which describes the electron state, to an 

' equation in which the interaction between the electron and the field has become 
non-local. The electron is then affected by all the values taken on by the field in a 
domain centered at the point r, and whose size is of the order of the Compton wave 
length h/m,c of the electron. This is the origin of the correction represented by the 


Darwin term. 


To understand this more precisely, assume that the potential energy of the electron, 


instead of being equal to V(r), is given by an expression of the form: 


[ee So) Vir +p) 


(B-13) 


where f (p) is a function whose integral is equal to 1, which only depends on |p|, and which takes 


on significant values only inside a volume of the order of (h/m,c)?, centered at p = 0. 


If we neglect the variation of V(r) over a distance of the order of h/m,c, we can replace 
V(r + p) by V(r) in (B-13) and take V(r) outside the integral, which is then equal to 1. 


(B-13) reduces, in this case, to V(r). 


A better approximation consists of replacing, in (B-13), V(r + p) by its Taylor series 
expansion in the neighborhood of p = 0. The zeroeth-order term gives V(r). The first-order term 
is zero because of the spherical symmetry of f (p). The second-order term involves the second 
derivatives of the potential energy V(r) at the point r and quadratic functions of the components 


of p, weighted by f(p) and integrated over d?p. This leads to a result of the order of 


(him.c)’AV(r) 
It is therefore easy to accept the idea that this second-order term should be the Darwin term. 
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(ii) Order of magnitude 
Replacing V(R) by — e?/R, we can write the Darwin term in the form : 


ag ($) IE (B-14) 


> # 
8m2zc? AR 2m3c? 


(we have used the expression for the Laplacian of 1/R given by formula (61) of 
appendix H). 

When we take the mean value of (B-14) in an atomic state, we find a contri- 
bution equal to: 

reh? > 


(0)? 


22 
2m0 


where y(0) is the value of the wave function at the origin. The Darwin term therefore 
affects only the s electrons, which are the only ones for which w(0) #0 
(cf. chap. VI, $ C-4-c). The order of magnitude of |y(0)|? can be obtained by taking 
the integral of the square of the modulus of the wave function over a volume of the 
order of aj (where «, is the Bohr radius) to be equal to 1. Thus we obtain : 

me? 


yo? =>. = (B-15) 


až h° 


which gives the order of magnitude of the Darwin term: 


ne*h? > > 2 
W, = 0) = mcr = m, cta? (B-16 
p 2m20? h40) hte ) 
Since Hy = m,c?a*, we again see that: 
Ws ia 
oe = (=) (B-17) 


Thus. all the fine structure terms are about 10* times smaller than the non- 
relativistic Hamiltonian of chapter VII. 


2. Magnetic interactions related to proton spin: 
the hyperfine Hamiltonian 


a. PROTON SPIN AND MAGNETIC MOMENT 


Thus far, we have considered the proton to be a physical point of mass M, 
and charge y, = — q. Actually, the proton, like the electron, is a spin 1/2 particle. 
We shall denote by I the corresponding spin observable. 

With the spin I of the proton is associated a magnetic moment M,. However, 
the gyromagnetic ratio is different from that of the electron: 


M; = 9,4, Uh (B-18) 
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where y, is the nuclear Bohr magneton: 


u E (B-19) 
"9M, 


and the factor g,, for the proton, is equal to:g, = 5.585. Because of the presence 
of M, (the proton mass) in the denominator of (B-19), y, is close to 2 000 times 
smaller than the Bohr magneton yp (recall that y = gh/2m,). Although the angular 
momenta of the proton and the electron are the same, nuclear magnetism, because 
of the mass difference, is much less important than electronic magnetism. The 
magnetic interactions due to the proton spin I are therefore very weak. 


b. THE MAGNETIC HYPERFINE HAMILTONIAN W,; 


The electron moves, therefore, not only in the electrostatic field of the proton, 
but also in the magnetic field created by M,. When we introduce the corresponding 
vector potential into the Schródinger equation*, we find that we must add to the 
Hamiltonian (B-1) an additional series of terms for which the expression is 
(cf. complement Axır): 


=- e 1 ak: Ms.M 
W, = o L.M; +=; [3(Ms.m\M,.m) ~ Ms. My] 
8x E 
+= Ms. M, ó(R) \ (B-20) 


Mi, is the spin magnetic moment of the electron, and n is the unit vector of the 
straight line joining the proton to the electron (fig. 1). 


Ms 
e 
FIGURE | 
n 
Relative disposition of the magnetic 
M, moments M, and M, of the proton and 
K the electron; n is the unit vector on the 
pN line joining the two particles. 


We shall see that W,, introduces energy shifts which are small compared to 
those created by W,. This is why W, y 1s called the “hyperfine structure Hamiltonian ”. 


* Since the hyperfine interactions are very small corrective terms, they can be found using the 
non-relativistic Schródinger equation. 
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c. INTERPRETATION OF THE VARIOUS TERMS OF W,; 


The first term of W,, represents the interaction of the nuclear magnetic 
moment M, with the magnetic field (1,/47)qL/m,r* created at the proton by the 
rotation of the electronic charge. 

The second term represents the dipole-dipole interaction between the 
electronic and nuclear magnetic moments: the interaction of the magnetic moment 
of the electron spin with the magnetic field created by M, (cf. complement B,,) 
or vice versa. 

Finally, the last term, also called Fermi’s “contact term”, arises from the 
singularity at r = 0 of the field created by the magnetic moment of the proton. 
In reality, the proton is not a point. It can be shown (cf. complement Ax) that 
the magnetic field inside the proton does not have the same form as the one created 
outside by M, (and which enters into the dipole-dipole interaction). The contact 
term describes the interaction of the magnetic moment of the electron spin with 
the magnetic field inside the proton (the “delta” function expresses the fact that 
this contact term exists, as its name indicates, only when the wave functions of the 
electron and proton overlap). 


d. ORDERS OF MAGNITUDE 


It can easily be shown that the order of magnitude of the first two terms 
of Wpis: 


qh? pw eh 1 


=== = B-21 
m,M,R? 4r mM ,c? R? l } 


By using (B-10), we see that these terms are about 2 000 times smaller than Woo. 
As for the last term of (B-20), it is also 2 000 times smaller than the Darwin 
term, which also contains a 6(R) function. 


C. THE FINE STRUCTURE OF THE n = 2 LEVEL 


1. Statement of the problem 


a. DEGENERACY OF THE n = 2 LEVEL 


We saw in chapter VII that the energy of the hydrogen atom depends only 
on the quantum number n. The 2s (n = 2, / = 0) and 2p (n = 2, | = 1) states 
therefore have the same energy, equal to: 


If the spins are ignored, the 2s subshell is composed of a single state, and 
the 2p subshell of three distinct states which differ by their eigenvalue m,h of the 
component L, of the orbital angular momentum L (m; = 1,0, — 1). Because of the 
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existence of electron and proton spins, the degeneracy of the n = 2 level is higher 
than the value calculated in chapter VII. The components S, and J, of the two spins 
can each take on two values : mz = + 1/2,m, = + 1/2. One possible orthonormal 
basis in the n = 2 level is then : 


1 
fir-s om =0:m=¿3m= Eg y (C-1) 


(2s subshell, of dimension 4) 


| In=231= tim, = - 1,0, +1; mon tymto} (C-2) 


(2p subshell, of dimension 12). 


The n = 2 shell then has a total degeneracy equal to 16. 

According to the results of chapter X1 ($C), in order to calculate the effect 
of a perturbation W on the n = 2 level, it is necessary to diagonalize the 
16 x 16 matrix representing the restriction of W to this level. The eigenvalues of 
this matrix are the first order corrections to the energy, and the corresponding 
eigenstates are the eigenstates of the Hamiltonian to zeroeth order. 


b. THE PERTURBATION HAMILTONIAN 


In all of this section, we shall assume that no external field is applied to the 
atom. The difference W between the exact Hamiltonian H and the Hamiltonian A, 
of chapter VII ($C) contains fine structure terms, indicated in §B-1 above : 


W, = War + Woo + Wp (C-3) 
and hyperfine structure terms W,,, introduced in $ B-2. We thus have: 
W=W +W; (C-4) 


Since W, is close to 2 000 times larger than W,, (cf. §B-2-d), we must obviously 
begin By studying the effect of W,, before considering that of W,,, on the n = 2 level. 
We shall see that the n = 16 e of this level is partially removed by W, 
The structure which appears in this way is called the “fine structure” 

W,, May then remove the remaining degeneracy of the fine structure levels 
and cause a “hyperfine structure” to appear inside each of these levels. 

In this section (§C), we shall confine ourselves to the study of the fine structure 
of the n = 2 level. The calculations can easily be generalized to other levels. 
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2. Matrix representation of the fine-structure Hamiltonian W, 
inside the n = 2 level 


a. GENERAL PROPERTIES 


The properties of W,, as we shall see, enable us to show that the 16 x 16 matrix 
which represents it in the n = 2 level can be broken down into a series of square 
submatrices of smaller dimensions. This will considerably simplify the determi- 
nation of the eigenvalues and eigenvectors of this matrix. 


o. W, does not act on the spin variables of the proton 


We see from (B-1) that the fine structure terms do not depend on I. It follows 
that the proton spin can be ignored in the study of the fine structure (afterwards, 
we multiply by 2 all the degrees of degeneracy obtained). The dimension of the 
matrix to be diagonalized therefore falls from 16 to 8. 


p. W, does not connect the 2s and 2p subshells 


Let us first prove that L? commutes with W,. The operator L? commutes with 
the various components of L, with R (L? acts only on the angular variables), with P? 
[cf. formula (A-16) of chapter VII], and with S (L? does not act on the spin 
variables). L? therefore commutes with W, (which is proportional to P*), with Wo 
(which depends only on R, L, S), and with W, (which depends only on R). 

The 2s and 2p states are eigenstates of L? with different eigenvalues (0 and 2h’), 
Therefore, W, which commutes with L?, has no matrix elements between a 2s state 
and a 2p state. The 8 x 8 matrix representing W, inside the n = 2 level can be 
broken down, consequently, into a 2 x 2 matrix relative to the 2s state and 


a6 x 6 matrix relative to the 2p state : 


SS 


SS 
SENS 
Ra 
OS S 
M 
` ~ . 
R 
sS 
> 
si 


COMMENT: 


The preceding property can also be considered to be a consequence of 
the fact that W, is even. Under a reflection, R changes to — R (R = |R| 
remains unchanged), P, to — P, L to L, and S to S. It is then easy to see 
that W, remains invariant. W, therefore has no matrix elements between 
the 2s and 2p states, which are of opposite parity (cf. complement F,,). 
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b. MATRIX REPRESENTATION OF W, IN THE 2s SUBSHELL 


The dimension 2 of the 2s subspace is the result of the two possible values 
m, = + 1/2 of S, (since we are ignoring /, for the moment). 

W , and W, do not depend on S. The matrices which represent these two 
operators in the 2s subspace are therefore multiples of the unit matrix, with pro- 
portionality coefficients equal, respectively, to the purely orbital matrix elements: 

4 


(n=2:1=0;m, = 0] — ja = 2:1=O0:m, =0> 


2 
Sige? 


and: 
2 


<n=2:1=0:m, =0| - AV(R)|n = 2:1=0:m, =0> 


h- 
8m2c 


Since we know the eigenfunctions of Hy, the calculation of these matrix elements 
presents no theoretical difficulty. We find (cf. complement B,,,): 


3 

C Wae Dag = a (C-5) 
l 2,4 

< W, das = 16 nC X (C-6) 


Finally, calculation of the matrix elements of Wo involves “angular” matrix 
elements of the form ¢/= 0, m, = 0|L,,.|/= 0, m, = 0), which are zero 
because of the value / = 0 of the quantum number /. Therefore: 


< Wo >2s = 0 (C-7) 


Thus, under the effect of the fine structure terms, the 2s subshell is shifted 
as a whole with respect to the position calculated in chapter VII by an amount 
equal to — 5m,c?a*/128. 


c. MATRIX REPRESENTATION OF W, IN THE 2p SUBSHELL 


a. W, and W, terms 


The W, and W, terms commute with the various components of L, since 
L acts only on the angular variables and commutes with R and P? (which depends 
on these variables only through L?: cf. chapter VI). L therefore commutes with W, . 
and W. Consequently, W, and Wp are scalar operators with respect to the orbital 
variables (cf. complement By,. $5-b). Since W, and Wp do not act on the spin 
variables, it follows that the matrices which represent MW, and W, inside the 


2p subspace are multiples of the unit matrix. The calculation of the proportionality 
coefficient 1s given in complement B,,, and leads to: 


7 > 
< Wane 23p =< 384 mc a" (C-8) 


< Wp >a, =0 (C-9) 
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The result (C-9) is due to the fact that W, is proportional to 6:R) and can therefore 
have a non-zero mean value only in an s state (for / > 1, the wave function is zero at 
the origin). 


B. Wo term 


We must calculate the various matrix elements: 


1 
(n=2;l=1l;s=>3;M,Ms 


j 1 
5 &(R)L.S|[n = 2:1 = 1:8 ==; m, ms, > 


2 


(C-10) 
with: 
e? 1 


2m? R? 


(C-11) 


If we use the { |r > } representation, we can separate the radial part of matrix 
element (C-10) from the angular and spin parts. Thus we obtain : 


1 l 


Ex, l= Lis = 53M; Ms L.S|l=1:s = 53 ML: ms > (C-12) 
where €,, is a number, equal to the radial integral: 
> e? l a . 
ča =3334 Rail)? o dr (C-13) 
me Jo r 


Since we know the radial function R,,(r) of the 2p state, we can calculate ¿,,. We 
find (cf. complement By yy): 


z l 2,4 
62p = 3 mcm C-14 
"> 48m? l ) 
The radial variables have therefore disappeared. According to (C-12). the 
problem is reduced to the diagonalization of the operator ¿,,L . S, which acts only 
on the angular and spin variables. 
To represent the operator ¿,,L. S by a matrix, several different bases can be 
chosen : 


— first of all, the basis: 


1 
Jira tis = :mims> } (C-15) 


which we have used thus far and which is constructed from common eigenstates 
of L?, S?, L,, S,: 
— or, introducing the total angular momentum: 
=L+S (C-16) 
the basis: 


[lates 


nl 


sim } (C-17) 
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constructed from the eigenstates common to L?, S?, J’, J,. According to the results 
of chapter X, since/ = land s = 1/2, J can take on two values : J = | + 1/2 = 3/2 
and J = 1 — 1/2 = 1/2. Furthermore, we know how to go from one basis to the 
other, thanks to the Clebsch-Gordan coefficients [formulas (36) of complement A, ]. 

We shall now show that the second basis (C-17) is better adapted than the 
first one to the problem which interests us here, since Cap . S is diagonal in the 
basis (C-17). To see this, we square both sides of (C-16). We find (Land S commute): 


P=(L4+SP=L74+874+2L.8 (C-18) 
which gives : 
1 2 
dp L.S=> Esp (J? — L? — S?) (C-19) 


Each of the basis vectors (C-17) is an eigenstate of L?, S?, J?; we thus have: 


We see from (C-20) that the eigenvalues of ¿,,L . S depend only on J and not 
on m,; they are equal to: 


1 3 3 3 1 
262 2 -3P E Čap h- = gg mec (C-21) 


for J = 1/2, and: 
1 1 3 1 1 
tna —2- af = +5 č, ħ? =a mat (C-22) 


for J = 3/2. 

The six-fold degeneracy of the 2p level is therefore partially removed by Wo- 
We obtain a four-fold degenerate level corresponding to J = 3/2, and a two-fold 
degenerate level corresponding to J = 1/2. The (2/ + 1)-fold degeneracy of each 
J state is an essential degeneracy related to the rotation invariance of W,. 


COMMENTS: 


(i) In the 2s subspace (/ = 0, s = 1/2), J can take on only one value, 
J=0+4 1/2 = 1/2. 


(ii) Inthe 2p subspace, W,,, and W, are represented by multiples of the unit matrix. 
This property remains valid in any basis since the unit matrix is invariant 
under a change of basis. The choice of basis (C-17), required by the Wo term, 
is therefore also adapted to the MW, and W, terms. 
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C. THE FINE STRUCTURE OF THE a = 2 LEVEL 


3. Results: the fine structure of the n = 2 level 


a. SPECTROSCOPIC NOTATION 


In addition to the quantum numbers n, / (and s), the preceding discussion 
introduced the quantum number J on which the energy correction due to the 
spin-orbit coupling term depends. 

For the 2s level, J = 1/2; for the 2p level, J = 1/2 or J = 3/2. The level asso- 
ciated with a set of values, n, /, J is generally denoted by adding an index J to the 
symbol representing the (n, /) subshell in spectroscopic notation (cf. chap. VII. 
§ C-4-b): 


nly (C-23) 


where / stands for the letter s for / = 0, p for l = 1,dfor/ = 2, f for / = 3... Thus, 
the n = 2 level of the hydrogen atom gives rise to the 2s,,5, 2p,,, and 2p3,, levels. 


b. POSITIONS OF THE 2s,;2, 2p,.. AND 2p,, LEVELS 


By regrouping the results of $ 2, we can now calculate the positions of the 2s, ,,, 
2p , ¡, and 2p,,, levels with respect to the “unperturbed” energy of the n = 2 level 
calculated in chapter VII and equal to — uc?a?/8. 

According to the results of $ 2-b, the 2s,,, level is lowered by a quantity equal 


to: 
-— metas (C-24) 
128 "e Í 
According to the results of $ 2-c, the 2p, ,, level is lowered by a quantity equal 
to: 
7 1 2.4 5 2,4 
=== —-— lat = — ——m,c? C-25 
( 384 im) i pares ve 


Thus we see that the 2s,,, and 2p,,, levels have the same energy. According to the 
theory presented here, this degeneracy must be considered to be accidental, as 
opposed to the essential (2/ + 1)-fold degeneracy of each J level. 

Finally, the 2p3,, level is lowered by a quantity: 


7 l 24 ol 2,4 
es +36) ar = 128 MC X (C-26) 


The preceding results are shown in figure 2. 


COMMENTS: 


(i) Only the spin-orbit coupling is responsible for the separation between the 
2pı;2 and 2p;,, levels, since W,, and Wp shift the entire 2p level as a whole. 
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(ii) The hydrogen atom can go from the 2p state to the ls state by emitting 
a Lyman a photon (å = 1216 A). The material presented in this chapter 
shows that, because of the spin-orbit coupling, the Lyman «a line actually 
contains two neighboring lines*, 2p,,, —=> 1s,,. and 2p3,. —> 15, 2, sepa- 
rated by an energy difference equal to: 


—~ m,e’ xt = =m,c%a 


128 


When they are observed with a sufficient resolution, the lines of the hydrogen 
spectrum therefore present a “fine structure”. 


25 1/2 2P 1/2 


FIGURE 2 


Fine structure of the n = 2 level of the hydrogen atom. Under the effect of the fine structure 
Hamiltonian W, the n = 2 level splits into three fine structure levels, written 2s,,,,2p,,, and 2p, 7. 
We have indicated the algebraic values of the shifts, calculated to first order in W,. The shifts are the 
same for the 2s,,, and 2p,,, levels (a result which remains valid, moreover, to all orders in W). 
When we take into account the quantum mechanical nature of the electromagnetic field, we find 
that the degeneracy between the 2s,,, and 2p,,, levels is removed (the Lamb shift; see figure 4). 


(iii) We see in figure 2 that the two levels with the same J have the same energy. This result 
is not merely true to first order in W,: it remains valid to all orders. The exact solution 
of the Dirac equation gives, for the energy of a level characterized by the quantum 
numbers a, /, s, J, the value: 


2 


2 
to o 
En = mel + a(n =J- 5 + YU + 1⁄2} — 2) | (C-27) 


* In the ground state, / = 0 and s = 1/2, so J can take on only the value J = 1/2. W, therefore 
does not remove the degeneracy of the ls state, and there is only one fine structure level, the 1s,,, level. 
This is a special case, since the ground state is the only one for which / is necessarily zero. This is why 
we have chosen here to study the excited n = 2 level. 
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We see that the energy depends only on n and J, and not on /. 
If we make a limited expansion of formula (C-27) in powers of a, we obtain: 


1 1 me n 3 
Exe mar (A ja to. (C-28) 


The first term is the rest-mass-energy of the electron. The second term follows from the 
theory of chapter VII. The third term gives the correction to first order in W, calculated 
in this chapter. 

(iv) Even in the absence of an external field and incident photons, a fluctuating electro- 
magnetic field must be considered to exist in space (cf. complement Ky, $ 3-d-6). This 
phenomenon is related to the quantum mechanical nature of the electromagnetic field, 
which we have not taken into consideration here. The coupling of the atom with these 
fluctuations of the electromagnetic field removes the degeneracy between the 2s,,, 
and 2p,/. levels. The 2s,,, level is raised with respect to the 2p,,, level by a quantity 
called the “Lamb shift” which is of the order of 1 060 MHz (fig. 4. page 1231). 

The theoretical and experimental study of this phenomenon, which was discovered 
in 1949, has been the object of a great deal of research, leading to the development of 
modern quantum electrodynamics. 


D. THE HYPERFINE STRUCTURE OF THE n = 1 LEVEL 


It would now seem logical to study the effect of M,, inside the fine structure levels 
251 ¡23 2p 1 2 and 2p3,,, in order to see if the interactions related to the proton spin I 
cause a hyperfine structure to appear in each of these levels. However, since W, does 
not remove the degeneracy of the ground state ls, it is simpler to study the effect 
of W,; on this state. The results obtained in this special case can easily be generalized 
to the 2s, 5, 2p,,, and 2p3,, levels. 


1. Statement of the problem 
a. THE DEGENERACY OF THE 1s LEVEL 

For the 1s level. there is no orbital degeneracy (/ = 0). On the other hand, 
the S, and /, components of S and I can still take on two values: mg = + 1/2 


and m, = + 1/2. The degeneracy of the 1s level is therefore equal to 4, and a possible 
basis in this level is given by the vectors: 


l 
{Ina tits 02m, = 0m tgm h (D-1) 


b. THE 1s LEVEL HAS NO FINE STRUCTURE 
We shall show that the W, term does not remove the degeneracy of the Is level. 
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The MW, and W, terms do not act on my and m,, and are represented in the 
ls subspace by multiples of the unit matrix. We find (cf. complement B,,,): 


È< Wae Dis = — F me’ at (D-2) 


1 
< Wpis = > mc? at (D-3) 


Finally, calculation of the matrix elements of the Wo term involves the “angular” 
matrix elements <l = 0, m, = 0|L,,,|/ = 0, m, = 0), which are obviously 
zero (/ = 0); therefore: 


< Wo?is = 0 (D-4) 


XYZ 


In conclusion, W, merely shifts the Is level as a whole by a quantity equal to: 


lo, 
(- 2 + ¿nes =e m, cto? (D-5) 


without splitting the level. This result could have been foreseen : since / = 0 and 
s = 1/2, J can take on only one value, J = 1/2, and the Is level therefore gives rise 
to only one fine structure level, ls, ,,. 

Since the Hamiltonian W, does not split the 1s level, we can now consider the 
effect of the My, term. To do so, we must first calculate the matrix which represents W, , 
in the 1s level. 


2. Matrix representation of W,, in the 1s level 


a. TERMS OTHER THAN THE CONTACT TERM 


Let us show that the first two terms of W,, [formula (B-20)] make no contri- 
bution. l 


Ho 4 
4r m,R? 
to the “angular” matrix elements < / = 0, m, = 0|L|/ = 0, m, = 0 >, which are 
obviously zero (/ = 0). 

Similarly, it can be shown (c/. complement B,,, $ 3) that the matrix elements 
of the second term (the dipole-dipole interaction) are zero because of the spherical 
symmetry of the Is state. 


Calculation of the contribution from the first term, — L.M,, leads 


b. THE CONTACT TERM 


The matrix elements of the last term of (B-20), that is, of the contact term, 
are of the form: 


<n =1;1=0;m, = 0; m; m | 


_ 2Ho 


3 Ms . M, ó(R)|n =1:1=0;m, =0;m5;m,> (D-6) 
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If we go into the {|r > } representation, we can separate the orbital and 
spin parts of this matrix element and put it in the form: 


A < mg: m, | E.S | mg; m > (D-7) 
where . is a number given by: 


y = E ¿n= 1;1=0;m, = 0] 0(R)!n = 1;1=0;m, =0> 
eye mM, 


E ee 
3e¢7 m, M40 
-3 
4 a ye 
= É y, Te meat e) (D-8) 
3°? M, M,) We 


We have used the expressions relating M, and M, to S and I [cf. (B-18)], as well as 

the expression for the radial function R,9(r) given in §C-4-c of chapter VII*. 
The orbital variables have therefore completely disappeared, and we are left 

with a problem of two spin 1/2°s, Land S, coupled by an interaction of the form: 


ATS (D-9) 


where .c/ is a constant. 


c. EIGENSTATES AND EIGENVALUES OF THE CONTACT TERM 


To represent the operator .4/ I. S, we have thus far considered only the basis: 


l 
lis memo b (D-10) 


formed by the eigenvectors common to S”, I’, S., /,. We can also, by introducing 
the total angular momentum **: 


F=Sil (D-11) 


— 


use the basis: 


fisey egim } (D-12) 


formed by the eigenstates common to S?, I’, F?, F,. Since s = I = 1/2, F can take 
on Only the two values F = 0 and F = |. We can easily pass from one basis to the 
other by means of (B-22) and (B-23) of chapter X. 


* The factor (1 +m /M,)" 3 in (D-8) arises from the fact that it is the reduced mass u which 
enters into R,y(0). lt so happens that, for the contact term, it is correct to take the nuclear finite mass 


effect into account in this way. 
** The total angular momentum is actually F = L + S + I, that is, F = J + 1. However, for 
the ground state, the orbital angular momentum is zero, so F reduces to (D-11). 
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The { | F, mp > } basis is better adapted than the { | ms, m, > } basis to the 
study of the operator .ZI. S, as this operator is represented in the { | F, m, > ) 
basis by a diagonal matrix (for the sake of simplicity, we do not explicitly write 
s = 1/2 and J = 1/2). This is true, since we obtain, from (D-11): 


AS = ZFP- P — S?) (D-13) 


It follows that the states | F, mp > are eigenstates of Z I. S: 


Ah? 
2 


AI.S|F,mp) = [F(F +1) — I +1)— S(S +1)]|F,mp> (D-14) 


We see from (D-14) that the eigenvalues depend only on F, and not on mp. They are 
equal to: 


Ah? 3 3] Ate 
2 2-3 E (pe) 
for F = 1, and: 
Ah? 3 3 3Ah? 
Eo 4 ] 4 pesto] 
for F = 0. 


The four-fold degeneracy of the Is level is therefore partially removed by MW;,,. 
We obtain a three-fold degenerate F = 1 level and a non-degenerate F = 0 level. 
The (2F + 1)-fold degeneracy of the F = 1 level is essential and is related to the 
invariance of Wp under a 1otation of the total system. 


3. The hyperfine structure of the 1s level 


a. POSITIONS OF THE LEVELS 


Under the effect of W., the energy of the Is level is lowered by a quantity 
m,c7a4/8 with respect to the value — juc%a?/2 calculated in chapter VIL. W, then 
splits the 1s,,, level into two hyperfine levels. separated by an energy </h’ (fig. 3). 


fh? is often called the “hyperfine structure of the ground state”. 
COMMENT: 


It could be found, similarly, that W, splits each of the fine structure 
levels 25,,5, 2p,,. and 2p,,, into a series of hyperfine levels, corresponding to 
all the values of F separated by one unit and included between J + / and 
|J — I|. For the 2s, , and 2p,,, levels, we have J = 1/2. Therefore, F takes 
on the two values F = | and F = 0. For the 2p,,, level, J = 3/2, and, conse- 
quently, we have F = 2 and F = 1 (cf: fig. 4). 
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Is FIGURE 3 


The hyperfine structure of the 


n = | level of the hydrogen 
atom. Under the effect of W,, 
the 7 = | level undergoes 


a global shift equal to 
1 — mata? 8; 


J can take on only one value, 
J = 12, When the hyperfine 
coupling W,, is taken into 
account, the 1s,,, level splits 


F=1 into two hyperfine levels, F = | 
,— and F = 0. The hyperfine tran- 
f = Ah? sition 
j 
A i F=] <> F=) 
Ls 1/2 \ 
i Sh? (the 21 cm line studied in 
\ 3 ? radioastronomy) has a fre- 
\ | = 4 Lh quency which is known experi- 
Ñ i mentally to twelve significant 
\ t figures (thanks to the hydrogen 
F=0 maser). 
= F=2 
| 2P 3;2 b F=1 
Z 
Z 
Z F=1 
“a 
S CRE 
: F=1 
2P 1.2 N F=0 
HGURE 4 


The hyperfine structure of the n = 2 level of the hydrogen atom. The separation ~ between the 
two levels 2s, ,, and 2p,,, is the Lamb shift, which is about ten times smaller than the fine structure 
splitting AE separating the two levels 2p, , and 2p, , (4 = 1057.8 MHz: AE = 10969,1 MHz). 
When the hyperfine coupling <>, is taken into account, each level splits into two hyperfine sublevels 
(the corresponding value of the quantum number F is indicated on the right-hand side of the figure). 
The hyperfine splittings are equal to 23.7 MHz for the 2p, , level, 177.56 MHz for the 2s, , level 
and 59,19 MHz for the 2p, , level (for the sake of clarity, the figure is not drawn to scale), 
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b. IMPORTANCE OF THE HYPERFINE STRUCTURE OF THE 1s LEVEL 


The hyperfine structure of the ground state of the hydrogen atom is currently 
the physical quantity which is known experimentally to the highest number of 
significant figures. Expressed in Hz, it is equal to*: 

Ah 


a A 1 420 405 751. 768 + 0.001 Hz (D-17) 


Such a high degree of experimental accuracy was made possible by the 
development of the “hydrogen maser” in 1963. The principle of such a device is, 
very schematically, the following: hydrogen atoms, previously sorted (by a magnetic 
selection of the Stern-Gerlach type) so as to choose those in the upper hyperfine 
level F = 1, are stored in a glass cell (the arrangement is similar to the one shown 
in figure 6 of complement F,y). This constitutes an amplifying medium for the 
E(F = 1) — E(F = 0) 

h 
to the hyperfine frequency, and if the losses of the cavity are small enough for the 
gain to be greater than the losses, the system becomes unstable and can oscillate : 
we Obtain an “atomic oscillator” (a maser). The frequency of the oscillator is very 
stable and of great spectral purity. [ts measurement gives directly the value of 
the hyperfine splitting, expressed in Hz. 

Note, finally, that hydrogen atoms in interstellar space are detected in 
radioastronomy by the radiation they emit spontaneously when they fall from the 
F = 1 hyperfine level to the F = 0 hyperfine level of the ground state (this transition 
corresponds to a wave length of 21 cm). Most of the information we possess about 
interstellar hydrogen clouds is supplied by the study of this 21 cm line. 


hyperfine frequency . If the cell is placed in a cavity tuned 


E. THE ZEEMAN EFFECT 
OF THE 1s GROUND STATE HYPERFINE STRUCTURE 


1. Statement of the problem 
a. THE ZEEMAN HAMILTONIAN Wz 


We now assume the atom to be placed in a static uniform magnetic field By 
parallel to Oz. This field interacts with the various magnetic moments present in 


the atom : the orbital and spin magnetic moments of the electron, M, = s4 L and 
m, 
% : : 4 , 
M, = 7 S, and the magnetic moment of the nucleus, M, = — Al [ cf. expres- 
pP 


sion (B- i8)]. 


* The calculations presented in this chapter are obviously completely incapable of predicting 
all these significant figures. Moreover, even the most advanced theories cannot, at the present time, 
explain more than the first five or six figures of (D-17). 
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The Zeeman Hamiltonian W, which describes the interaction energy of the 
atom with the field B, can then be written: 


lL, + 28.) + o,f (E-1) 


where wọ (the Larmor angular frequency in the field B,) and œw, are defined by: 


q 
y = = 3,7 Bo (E-2) 
q 
O, = TM, JpBo (E-3) 


Since M, > m, we clearly have: 


[| > jonl (E-4) 


COMMENT: 


Rigorously, W, contains another term, which is quadratic in B, (the 
diamagnetic term). This term does not act on the electronic and nuclear spin 
variables and merely shifts the Is level as a whole, without modifying its 
Zeeman diagram, which we shall study later. Moreover. it is much smaller 
than (E-1). Recall that a detailed study of the effect of the diamagnetic term 
is presented in complement Dy- 


b. THE PERTURBATION "SEEN” BY THE 1s LEVEL 


In this section, we propose to study the effect of W, on the ls ground state 
of the hydrogen atom (the case of the n = 2 level is slightly more complicated since, 
in a zero magnetic field, this level possesses both a fine and a hyperfine structure, 
while the n = | level has only a hyperfine structure; the principle of the calculation 
is nevertheless the same). Even with the strongest magnetic fields that can be pro- 
duced in the laboratory, W, is much smaller than the distance between the Is level 
and the other levels: consequently, its effect can be treated by perturbation theory. 

The effect of a magnetic field on an atomic energy level is called the “ Zeeman 
effect”. When Bo is plotted on the x-axis and the energies of the various sublevels 
it creates are plotted on the y-axis, a Zeeman diagram is obtained. 

If B, is sufficiently strong. the Zeeman Hamiltonian W, can be of the same 
order of magnitude as the hyperfine Hamiltonian W,,*, or even larger. On the other 
hand, if By is very weak, W, < Wip Therefore, in general it is not possible to 
establish the relative importance of W, and W. To obtain the energies of the 
various sublevels. (W, + W,,) must be diagonalized inside the n = 1 level. 


* Recall that W; shifts the 1s level as a whole: it therefore also shifts the Zeeman diagram as 
a whole. 
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We showed in $ D-2 that the restriction of M;, to the n = 1 level could be put 
in the form .7 I . S. Using expression (E-1) for MW, we see that we must also calculate 
matrix elements of the form: 

(n=1;/=0;m, = 0: mg: m,|wo(L, + 28.) + o,f, 

ln =1;/=0:m, =O0:ms:m,>  (E-S) 


The contribution of w,L, is zero, since / and m, are zero. Since 20,5, + w,/, acts 
only on the spin variables, we can, for this term, separate the orbital part of the 
matrix element: 


(n=1;/=0;m, =0|n =1;/=0:m, =0>=1 (E-6) 


from the spin part. 
In conclusion, therefore, we must, ignoring the quantum numbers n, /, m,, 
diagonalize the operator : 


ALS + 2098. + pl (E-7) 


which acts only on the spin degrees of freedom. To do so, we can use either the 
[| ms, m; > y basis or the { | F, mp > } basis. 

According to (E-4), the last term of (E-7) is much smaller than the second one. 
To simplify the discussion, we shall neglect the term «,/_ from now on (it would be 
possible, however, to take it into account*). The perturbation “seen” by the 
ls level can therefore be written, finally: 


A1.S + 2095, (E-8) 


c. DIFFERENT DOMAINS OF FIELD STRENGTH 

By varying Bọ, we can continuously modify the magnitude of the Zeeman 
term 2woS,. We shall consider three different field strengths, determined by the 
respective orders of magnitude of the hyperfine term and the Zeeman term: 


(1) how, < Wh? : weak fields 

(ii) hw, > Wh? : strong fields 

(iii) ho, = £h? : intermediate fields 
We shall later see that it is possible to diagonalize operator (E-8) exactly. However, 
in order to give a particularly simple example of perturbation theory, we shall use 
a slightly different method in cases (/) and (ii). In case (i), we shall treat 21,5, like 
a perturbation with respect to z I. S. On the other hand, in case (i/), we shall treat 
Y I.S like a perturbation with respect to 2w,5,. The exact diagonalization of the 


set of two operators, indispensable in case (iii), will allow us to check the preceding 
results. 


2. The weak-field Zeeman effect 


The eigenstates and eigenvalues of. I. S have already been determined ($ D-2). 
We therefore obtain two different levels : the three-fold degenerate level, 


{|F=1:m, = —1,0,+ 1}, 


* This is what we do in complement Cy ¡¡, in which we study the hydrogen-like systems (muonium, 
positronium) for which it is not possible to neglect the magnetic moment of one of the two particles. 
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THE ZEEMAN EFFECT OF THE HYPERFINE STRUCTURE 


of energy .4h*/4, and the non-degenerate level, { | F = 0; mp = 0 > |, of energy 
— 3.v/ hi?*/4. Since we are treating 2w,5. like a perturbation with respect to ~ I. S. 
we must now separately diagonalize the two matrices representing 2:9,S, in the two 
levels, F = | and F = 0, corresponding to two distinct eigenvalues of ./ I.S. 


a. MATRIX REPRESENTATION OF S, IN THE {|F,m, >} BASIS 


Since we shall need it later. we shall begin by writing the matrix which 
represents S, in the {| F, mp > } basis (for the problem which concerns us here. 
it would suffice to write the two submatrices corresponding to the F = 1 and 


F = 0 subspaces). 


By using formulas (B-22) and (B-23) of chapter X, we easily obtain: 


S,|F = limp, = 
S| F =\:m, 
S |F = 13m, = 
S | F = 0; mp 


0> 


= 


= limp =1) 


bol st bol st 


(E-9) 


S| F = Limp =0) 


which gives the following expression for the matrix representing S. in the 
| F. my > } basis (the basis vectors are arranged in the order | 1.1 >, | 1.0). 


Lie = 1 0,07 


COMMENT: 


It is instructive to compare the preceding matrix with the one which represents F. 


in the same basis: 


(E-10) 


(E-11) 
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We see, first of all, that the two matrices are not proportional: the (F,) matrix 
is diagonal, while the (S,) one is not. 

However, if we confine ourselves to the restrictions of the two matrices in the F = 1 
subspace [limited by the darker line in expressions (E-10) and (E-11)]. we see that they 
are proportional. Denoting by P, the projector onto the F = | subspace (cf. comple- 
ment B,,), we have: 


1 
P, S, P, =5P, F, P, (E-12) 


It would be simple to show that the same relation exists between S, and F, on the one 
hand, and S, and F,, on the other. 

We have thus found a special case of the Wigner-Eckart theorem (complement Dy), 
according to which, in a given eigensubspace of the total angular momentum, all the 
matrices which represent vector operators are proportional. It is clear from this example 
that this proportionality exists only for the restrictions of operators to a given eigen- 
subspace of the total angular momentum, and not for the operators themselves. 

Moreover, the proportionality coefficient 1/2 which appears in (E-12) can be 
obtained immediately from the projection theorem. According to formula (30) of 
complement Dx, this coefficient is equal to: 
<S.F >p; F(F +1) +s +1) -1(1 +1) 


(F?),., OF + (E-13) 


Since s = J = 1/2, (E-13) is indeed equal to 1/2. 


b. WEAK-FIELD EIGENSTATES AND EIGENVALUES 


According to the results of $a, the matrix which represents 2w,S, in the 
F = 1 level can be written : 


(E-14) 


In the F = 0 level, this matrix reduces to a number, equal to 0. 
Since these two matrices are diagonal, we can immediately find the weak-field 
eigenstates (to zeroeth order in wọ) and the eigenvalues (to first order in w,): 


Eigenstates Eigenvalues 
[F=lim=1> <> a + hog 

|F=lim=0) <> Li 

|F=tim= 1) > HE os (E-15) 
|F =0;m, =0) <> E 49 
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In figure 5, we have plotted hw, on the x-axis and the energies of the four 
Zeeman sublevels on the y-axis (Zeeman diagram). In a zero field, we have the two 
hyperfine levels, F = 1 and F = 0. When the field B, is turned on, the |F =0,m, =0> 
sublevel, which is not degenerate, starts horizontally; as for the F = 1 level, its 
three-fold degeneracy is completely removed: three equidistant sublevels are 
obtained, varying linearly with ha, with slopes of + 1, 0, — 1 respectively. 


E 
Mp 
rl 
F=1 0 
= 1 
N 
e 
» 
F=0 0 FIGURE 5 
The weak-field Zeeman diagram of the ls 
ground state of the hydrogen atom. The 
hyperfine F = 1 level splits into three equi- 
| distant levels, each of which corresponds to 
a well-defined value of the quantum number mp. 
h The F = 0 level does not undergo any shift 
0 mp to first order in w,. 


The preceding treatment is valid as long as the difference hw, between two 
adjacent Zeeman sublevels of the F = | level remains much smaller than the zero- 
field difference between the F = | and F = 0 levels (the hyperfine structure). 


COMMENT: 


The Wigner-Eckart theorem, mentioned above, makes it possible to show that, 
in a given level F of the total angular momentum, the Zeeman Hamiltonian w,(L, + 2S_) 
is represented by a matrix proportional to F,. Thus, we can write, denoting the projector 
onto the F level by Pp: 


Py[@o(L, + 25,))Pe = gp@oPpF Pp (E-16) 


gr is called the Landé factor of the F state. In the case which concerns us here, g-_, = L 


c. THE BOHR FREQUENCIES INVOLVED IN THE EVOLUTION OF <F> 
AND <S>. COMPARISON WITH THE VECTOR MODEL OF THE ATOM 


In this section, we shall determine the different Bohr frequencies which appear in the 
evolution of < F > and < S >, and show that certain aspects of the results obtained recall those 
found by using the vector model of the atom (cf. complement Fy). 
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First of all, we shall briefly review the predictions of the vector model of the atom 
(in which the various angular momenta are treated like classical vectors) as far as the hyperfine 
coupling between I and S is concerned. In a zero field, F = 1 + S is a constant of the motion. 
I and S precess about their resultant F with an angular velocity proportional to the coupling 
constant .4 between I and S. If the system is, in addition, placed in a weak static field B, 
parallel to Oz, onto the rapid precessional motion of I and S about F is superposed a slow 
precessional motion of F about Oz (Larmor precession; fig. 6). 


5w 
—> -> 


FIGURE 6 


The motion of S, I and F in the vector 
model of the atom. S and I precess 
rapidly about F under the effect of the 
hyperfine coupling. In a weak field, 
F slowly precesses about B (Larmor 
precession). 


F, is therefore a constant of the motion, while S, has a static part (the projection onto Oz 
of the component of S parallel to F), and a part which is modulated by the hyperfine precession 
frequency (the projection onto Oz of the component of S perpendicular to F, which precesses 
about F). 

Let us compare these semi-classical results with those of the quantum theory presented 
earlier in this section. To do so, we must consider the time evolution of the mean values < F, > 
and < S, >. According to the discussion of $ D-2-d of chapter HI, the mean value < G >(r) of a 
physical quantity G contains a series of components which oscillate at the various Bohr 
frequencies (E — E'Yh of the system. Also, a given Bohr frequency appears in < G >(1) only 
if the matrix element of G between the states corresponding to the two energies is different 
from zero. In the problem which concerns us here, the eigenstates of the weak-field Hamiltonian 
are the | F, mp > states. Now consider the two matrices (E-10) and (E-11) which represent S, 
and F, in this basis. Since F, has only diagonal matrix elements, no Bohr frequency different 
from zero can appear in < F, X(t) : < F, > is therefore static. On the other hand, S, has, not only 
diagonal matrix elements (with which is associated a static component of < S, >), but also a non- 
diagonal element between the | F = 1, mp = 0) and | F = 0, mp = O > states, whose energy 
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difference is .4%?, according to table (E-15) (or figure 5). It follows that < S, > has, in addition 
to a static component, a component modulated at the angular frequency .4h. This result 
recalls the one obtained using the vector model of the atom*. 


COMMENT: 


A relation can be established between perturbation theory and the vector model 
of the atom. The influence of a weak field By on the F = land F = 0 levels can be obtained 
by retaining in the Zeeman Hamiltonian 2w,5S, only the matrix elements in the F = 1 
and F = 0 levels, “forgetting” the matrix element of S, between | F = 1; mp = 0 > and 
| F = 0;m, = 0 >. Proceeding in this way, we also “forget” the modulated component 
of < S, >, which is proportional to this matrix element. We therefore keep only the 
component of < S > parallel to < F >. 

Now, this is precisely what we do in the vector model of the atom when we want 
to evaluate the interaction energy with the field By. In a weak field, F does precess 
about By much more slowly than S does about F. The interaction of By with the 
component of S perpendicular to F therefore has no effect, on the average; only the 
projection of S onto F counts. This is how, for example, the Lande factor is calculated. 


3. The strong-field Zeeman effect 


We must now start by diagonalizing the Zeeman term. 


a. EIGENSTATES AND EIGENVALUES OF THE ZEEMAN TERM 


This term is diagonal in the { | ms, m; > } basis: 


2098, 


ms. mM, > = 2mshowy | ms, m; > (E-17) 


Since ms = + 1/2, the eigenvalues are equal to + hw,. Each of them is therefore 
two-fold degenerate, because of the two possible values of m,. We therefore have**: 


+ ha | +,+)> 
—ho,|-,+) 


oe |+,+)> (E-18) 


2095, | —-,+> 


* A parallel could also be established between the evolution of < F, >, < S, >, < F, >. < S, >, and 
that of the projections of the vectors F and S of figure 6 onto Ox and Oy. However, the motion of < F > 
and < S> does not coincide perfectly with that of the classical angular momenta. In particular, the 
modulus of <S > is not necessarily constant (in quantum mechanics, < S? > # < S >*): see discussion 
of complement Fy. 

** To simplify the notation, we shall often write | eg, e, > instead of | ms, m, >, where es and e, 
are equal to + or —, depending on the signs of my and m,. 
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b. EFFECTS OF THE HYPERFINE TERM CONSIDERED AS A PERTURBATION 


The corrections to first order in .4 can be obtained by diagonalizing the 
restrictions of the operator £ I. S to the two subspaces {| +, + > }and{|—,+)} 
corresponding to the two different eigenvalues of 2w,S,. 

First of all, notice that, in each of these two subspaces, the two basis vectors 


| +, + >and | +, — (or | —, + > and | —, — >)are also eigenvectors of F,, but 
do not correspond to the same value of mp = ms + m; Since the operator 
4 Y.S = z (F? — I? — S?) commutes with F,, it has no matrix elements between 
the two states |+, +>and| +, — >, or | —, + >and | —, — >. The two matrices 
representing .Y4 I. S in the two subspaces {| +, + >} and [| —, + > } are then 


diagonal, and their eigenvalues are simply the diagonal elements : 


<ms; m | 1.8 |m; mY, 
which can also be written, using the relation: 


I.S = LS, +58. +1_S,) (E-19) 


in the form: 


< ms, m; | 1.S | ms, m» 
= (ms, M; | AIS, | mz, m,> = AW mgm, (E-20) 


Finally, in a strong field, the eigenstates (to zeroeth order in .4) and the eigen- 
values (to first order in .7) are: 


Eigenstates Eigenvalues 


2 
| +, +> <> hog HE 


Ah? 
4 


| +,- >) <> hay — 


an (E-21) 


4 
An? 
4 


|=,=> <> — ho, + 


In figure 7, the solid-line curves on the right-hand side (for ha, > Lh?) represent 
the strong-field energy levels: we obtain two parallel straight lines of slope + 1, 
separated by an energy .4h?/2, and two parallel straight lines of slope — 1, also 
separated by .4h?/2. The perturbation treatments presented in this section and the 
preceding one therefore give the strong-field asymptotes and the tangents at the 
origin of the energy levels. 


1240 


www.elsolucionario.net 
E. THE ZEEMAN EFFECT OF THE HYPERFINE STRUCTURE 


Es Ej FIGURE 7 
The strong-field Zeeman diagram of the Is 
++ ground state of the hydrogen atom. For each 
orientation of the electronic spin (£s = + or 
€s = —), we obtain two parallel straight lines 


A separated by an energy .Zh?/2, each one 
z corresponding to a different orientation of 
Y the proton spin (e, = + or e, = —). 
rs 
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COMMENT: 


The strong-field splitting /h?/2 of the two states, | +, + > and |+,-=> 
or | —, + > and | —, — >, can be interpreted in the following way. We have 
seen that only the term 1,S, of expression (E-19) for I.S is involved in a strong 
field, when the hyperfine coupling is treated like a perturbation of the Zeeman 
term. The total strong-field Hamiltonian (E-8) can therefore be written: 


2095, + WIS, = 2( "+ z 1.Ja (E-22) 


39 


It is as if the electronic spin “saw”, in addition to the external field Bo, 
a smaller “internal field”, arising from the hyperfine coupling between I 
and S and having two possible values, depending on whether the nuclear spin 
points up or down. This field adds to or substracts from B, and is responsible 
for the energy difference between | +, + >and | +, — > or between | —, + > 
and | —, — >. 
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c. THE BOHR FREQUENCIES INVOLVED IN THE EVOLUTION OF <S> 


In a strong field, the Zeeman coupling of S with B, is more important than the hyperfine 
coupling of S with I. If we start by neglecting this hyperfine coupling, the vector model of the 
atom predicts that S will precess (very rapidly since |B,| is large) about the Oz direction of By 
(I remains motionless, since we have assumed w, to be negligible). 


w 
© 
> --------» 


FIGURE 8 


The motion of S in the vector model of the atom. In 
a strong field, S precesses rapidly about B, (here we are 
neglecting both the Zeeman coupling between I and B, 
and the hyperfine coupling between I and S, so that I 
remains motionless). 


Expression (E-19) for the hyperfine coupling remains valid for classical vectors, Because 
of the very rapid precession of S, the terms S, and S_ oscillate very fast and have. on the 
average, no effect, so that only the term /,S, counts. The effect of the hyperfine coupling is 
therefore to add a small field parallel to Oz and proportional to /_ (cf. comment of the preceding 
section), which accelerates or slows down the precession of S about Oz, depending on the 
sign of /,. The vector model of the atom thus predicts that S, will be static in a strong field. 

We shall show that quantum theory gives an analogous result for the mean value < S,> 
of the observable S,. In a strong field, the well-defined energy states are, as we have seen, 
the states | my, m, >. Now, in this basis, the operator S, has only diagonal matrix elements. 
No non-zero Bohr frequency can therefore appear in < S, >, which, consequently, is a static 
quantity*, unlike its weak-field counterpart (cf. $ E-2-c). 


* The study of < S, > and < S, > presents no difficulty. We find two Bohr angular frequencies : 
one, (0, + .Zh/2, slightly larger than wo, and the other one, wy — .Wh/2, slightly smaller. They correspond 
to the two possible orientations of the “internal field”, produced by /,, which adds to the external field Bo. 

Similarly, we find that I precesses about the “internal field” produced by S.. 
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4. The intermediate-field Zeeman effect 


a. THE MATRIX WHICH REPRESENTS THE TOTAL PERTURBATION 
IN THE (|F,m,>) BASIS 


The | F, mp > states are eigenstates of the operator .4 1. S. The matrix which 
represents this operator in the { | F, mp > } basis is therefore diagonal. The diagonal 
elements corresponding to F = | are equal to .4%?/4, and those corresponding to 
F = 0, to — 3.4 h?/4. Furthermore, we have already written, in (E-10), the matrix 
representation of $, in the same basis. It is now very simple to write the matrix which 
represents the total perturbation (E-8). Arranging the basis vectors in the order 


|1, 1>, |1, —1),|1,0>, | 0, 0 >, we thus obtain: 
2 
mm + hag 0 | 0 0 
A 7 A | ex 
0 a 0 0 
— i ae (E-23) 
0 i 0 | 4 hao 
j i 3h? 
0 0 hw, -3 
COMMENT: 


S, and F, commute; 2w,S, can therefore have non-zero matrix elements 
only between two states with the same m,. Thus, we could have predicted all 
the zeros of matrix (E-23). 


b. ENERGY VALUES IN AN ARBITRARY FIELD 


Matrix (E-23) can be broken into two 1 x 1 matrices and one 2 x 2 matrix. 
The two 1 x 1 matrices immediately yield two eigenvalues : 


2 
Ei = zm + hoo 
Pe (E-24) 
E, = y~ ho, 


corresponding respectively to the state | 1, 1 > (that is, the state | +, + >) and to 
the state |1, — 1 > (that is, the state | —, — >). In figure 9, the two straight lines of 
slopes + 1 and — | passing through the point whose ordinate is + .4h?/4 for a 
zero field (for which the perturbation theory treatment gave only the initial and 
asymptotic behavior) therefore represent, for any B,, two of the Zeeman sublevels. 
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— 34 h?/4 
(F = 0) 


FIGURE 9 


The Zeeman diagram (for an arbitrary field) of the ls ground state of the hydrogen atom : 
m, remains a good quantum number for any value of the field. We obtain two straight lines, of 
opposite slopes, corresponding to the values + 1 and — 1 of mp, as well as a hyperbola whose two 
branches are associated with the two m, = 0 levels. Figures 5 and 7 give, respectively, the tangents 
at the origin and the asymptotes of the levels shown in this diagram. 


The eigenvalue equation of the remaining 2 x 2 matrix can be written: 


(+ - e- 2i — E) -Ro =0 (E-25) 
The two roots of this equation can easily be found: 

pa (5) +10 (E-26) 

Ea (SF) + Rw (E-27) 
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When hw, varies, the two points of abscissas ha, and ordinates E, and E, follow 
the two branches of a hyperbola (fig. 9). The asymptotes of this hyperbola are the 
two straight lines whose equation is E = — (.4h?/4) + ho, obtained in $3 above. 
The two turning points of the hyperbola have abscissas of œ = 0 and ordinates of 
— (Lh?/4) + Ah? /2, that is, .4h?/4 and — 3.4h?/4. The tangents at both these 
points are horizontal. This is in agreement with the results of §2 for the states 
|F =1;m, =0>and|F=0;m,=0). 

The preceding results are summarized in figure 9, which is the Zeeman 
diagram of the 1s ground state. 


c. PARTIAL HYPERFINE DECOUPLING 


In a weak field, the well-defined energy states are the states | F, mp y; in 
a strong field, the states | my, m, >; in an intermediate field, the eigenstates of 
matrix (E-23), which are intermediate between the states | F, mp > and the states 
| ms, my >. 

One thus moves continuously from a strong coupling between I and S (coupled 
bases) to a total decoupling (uncoupled bases) via a partial coupling. 


COMMENT: 


An analogous phenomenon exists for the Zeeman fine structure effect. 
If, for simplicity, we neglect W,,, we know ($ C) that, in a zero field, the 
eigenstates of the Hamiltonian H are the | J, m, > states corresponding to 
a strong coupling between L and S (the spin-orbit coupling). This property 
remains valid as long as W, < W,. If, on the other hand, B, is strong enough 
to make W, > W,, we find that the eigenstates of H are the | m,, mg > states 
corresponding to a total decoupling of L and S. The intermediate zone 
(W, = W,) corresponds to a partial coupling of L and S. See, for example, 
complement Dx, in which we study the Zeeman effect of the 2p level (without 
taking W,, into account). 


References and suggestions for further reading: 


The hydrogen atom spectrum: Series (11.7), Bethe and Salpeter (11.10). 

The Dirac equation : the subsection “ Relativistic quantum mechanics” of section 2 
of the bibliography and Messiah (1.17), chap. XX, especially §§ V and IV-27. 

The fine structure of the n = 2 level and the Lamb shift: Lamb and Retherford 
(3.11); Frisch (3.13); Series (11.7), chaps. VI, VII and VIII. 

The hyperfine structure of the ground state: Crampton et al (3.12). 

The Zeeman effect and the vector model of the atom: Cagnac and Pebay- 
Peyroula (11.2), chap. XVII, §§3E and 4C; Born (11.4), chap. 6, §2. 

Interstellar hydrogen: Roberts (11.17); Encrenaz (12.11), chap. IV. 
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COMPLEMENTS OF CHAPTER XII 


Axn: THE MAGNETIC HYPERFINE HAMILTONIAN 


Bn: CALCULATION OF THE MEAN VALUES 
OF THE FINE-STRUCTURE HAMILTONIAN 
IN THE 1s, 2s AND 2p STATES 


Cyn: THE HYPERFINE STRUCTURE 
AND THE ZEEMAN EFFECT 
FOR MUONIUM AND POSITRONIUM 


Dxu: THE INFLUENCE OF THE ELECTRON SPIN 
ON THE ZEEMAN EFFECT 
OF THE HYDROGEN RESONANCE LINE 


Axu: derivation of the expression for the hyper- 
fine Hamiltonian used in chapter XII. Gives the 
physical interpretation of the various terms 
appearing in this Hamiltonian — in particular, 
the contact term. Rather difficult. 


Bxi: the detailed calculation of certain radial 
integrals appearing in the expressions obtained 
in chapter XII for the energy shifts. Not concep- 
tually difficult. 


Cxp: extension of the study of $8 D and E of 
chapter XII to two important hydrogen-like 
systems, muonium and positronium, already pre- 
sented in complement Ay. Brief description of 
experimental methods for studying these two 
systems. Simple if the calculations of $8 D and E 
of chapter XTI have been well understood. 


Dip: study of the effect of the electronic spin 
on the frequencies and polarizations of the 
Zeeman components of the resonance line of 
hydrogen. Improves the results obtained in 
complement Dy; in which the electron spin was 
ignored (uses certain results of that complement). 
Moderately difficult. 


Exu: THE STARK EFFECT 
FOR THE HYDROGEN ATOM 


Exu: study of the effect of a static electric field 
on the ground state (n = 1) and the first excited 
state (n = 2) of the hydrogen atom (the Stark 
effect). Shows the importance for the Stark effect 
of the existence of a degeneracy between two 
states of different parities. Rather simple. 
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Complement Axu 


THE MAGNETIC HYPERFINE HAMILTONIAN 


1. Interaction of the electron with the scalar and vector potentials created by the proton 


2. Detailed form of the hyperfine Hamiltonian 
a. Coupling of the magnetic moment of the proton with the orbital angular momentum of the electron 
b. Coupling with the electron spin 
a. The magnetic field associated with the proton 
Bb. The magnetic dipole term 
y. The contact term 


3. Conclusion: the hyperfine-structure Hamiltonian 


The aim of this complement is to justify the expression for the hyperfine Hamiltonian 
given in chapter XII [relation (B-20)]. As in that chapter, we shall confine our 
reasoning to the hydrogen atom, which is composed of a single electron and a proton, 
although most of the ideas remain valid for any atom. We have already said that 
the origin of the hyperfine Hamiltonian is the coupling between the electron and the 
electromagnetic field created by the proton. We shall therefore call A,(r) and U;(r) 
the vector and scalar potentials associated with this electromagnetic field. We shall 
begin by considering the Hamiltonian of an electron subjected to these potentials. 


1. Interaction of the electron with the scalar 
and vector potentials created by the proton 


Let R and P be the position and momentum of the electron, S, its spin: m., its 
mass; and q, its charge; ug = qh/2m, is the Bohr magneton. 
The Hamiltonian H of the electron in the field of the proton can be written: 


H = -L [P — gA(R)]}*’+ qU(R) — 2u(3) Vx A(R) (1) 


This operator is obtained by adding the coupling energy between the magnetic 
moment 2u,S/h associated with the spin and the magnetic field V x Aj(r) to 
expression (B-46) of chapter III (the Hamiltonian of a spinless particle). 

We shall begin by studying the terms which, in (1), arise from the scalar 
potential U;(r). According to complement Ey, this potential results from the super- 
position of several contributions, each of them associated with one of the electric 
multipole moments of the nucleus. For an arbitrary nucleus, we must consider: 
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(i) The total charge — Zq of the nucleus (the moment of order k = 0), which 
yields a potential energy : 


2 
ee (2) 
Anéor 


(with, for the proton, Z = 1). Now, the Hamiltonian which we chose in chapter VII 
for the study of the hydrogen atom is precisely: 


57 + Vo(R) (3) 


V,(R) has therefore already been taken into account in the principal Hamiltonian Ho. 


(ii) The electric quadrupole moment (k = 2) of the nucleus. The corresponding 
potential adds to the potential Y, and yields a term of the hyperfine Hamiltonian, 
called the electric quadrupole term. The results of complement E, enable us to write 
this term without difficulty. In the case of the hydrogen atom, it is zero, since the 
proton, which is a spin 1/2 particle, has no electric quadrupole moment (cf. §2-c-« 
of complement Ex). 

(iii) The electric multipole moments of order k = 4, 6, etc... which are theoret- 
ically involved as long as k < 2/; for the proton, they are all zero. 

Thus, for the hydrogen atom, potential (2) is really the potential seen by the 
electron*. There is no need to add any corrections to it (by hydrogen atom, we 
mean the electron-proton system, excluding isotopes such as deuterium : since the 
deuterium nucleus has a spin J = 1, we would have to take into account an electric 
quadrupole hyperfine Hamiltonian — see comment (i) at the end of this complement). 

Now let us consider the terms arising from the vector potential A,(r) in (1). 
We denote by M, the magnetic dipole moment of the proton (which, for the same 
reason as above, cannot have magnetic multipole moments of order k > 1). We 
have: 


M, x r 
Alr) = LALA (4) 
4n r 


The hyperfine Hamiltonian W,, can now be obtained by retaining in (1) the terms 
which are linear in A;: 


Wy = — ai [P. AR) + AYR). P] — 2u(F)- V x A,(R) (5) 


and by replacing A, by expression (4) (since Wj, already makes a very small 
correction to the energy levels of Ho, it is perfectly legitimate to ignore the second- 
order term, in A?). This is what we shall do in the following section. 


* We are concerned here only with the potential outside the nucleus, where the multipole moment 
expansion is possible. Inside the nucleus, we know that the potential does not have form (2). This causes 
a shift in the atomic levels called the “volume effect”. This effect was studied in complement Dyp and 
we shall not take it into account here. 
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2. The detailed form of the hyperfine Hamiltonian 


a. COUPLING OF THE MAGNETIC MOMENT OF THE PROTON 
WITH THE ORBITAL ANGULAR MOMENTUM OF THE ELECTRON 


First of all, we shall calculate the first term of (5). Using (4), we have: 


PAW) + A(R) -P = $2 [P (My) o.) (6) 


We can apply the rules for a mixed vector product to vector operators as long as 
we do not change the order of two non-commuting operators. The components 
of M, commute with R and P, so we have: 


(M, x R).P=(RxP).M,=L.M, (7) 


where: 
L=RxP (8) 


is the orbital angular momentum of the electron. It can easily be shown that: 


pS =0 (9) 


(any function of |R| is a scalar operator), so that: 


4M, x R). P = eM (10) 
Similarly : 

P (M, x R) = - M;.(P x RT = (11) 
since: 

—~PxR=L (12) 
Thus, the first term of (5) makes a contribution Wt, to W,, which is equal to: 

Wi = de LO MEE Le MEA (13) 


This term corresponds to the coupling between the nuclear magnetic moment M, 
and the magnetic field : 


B = Ho IL 
* dama? 


created by the current loop associated with the rotation of the electron (cf. fig. 1). 
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M, 


FIGURE | 


Relative disposition of the magnetic moment M, 
of the proton and the field B, created by the 
current loop associated with the motion of the 
electron of charge y and velocity y (B, is anti- 
parallel to the orbital angular momentum L of 
the electron). 


COMMENT: 


The presence of the 1/R* term in (13) might lead us to believe that there is 
a singularity at the origin, and that certain matrix elements of W{, are infinite. Actually, 
this is not the case. Consider the matrix element < Pr 1m | Wks | Prr mo D> Where | Ox 1 m > 
and | Oy: im > are the stationary states of the hydrogen atom found in chapter VII. 
In the { | r > } representation, we have: 


<r | Prtim > = Primli) = Ri dr) YO. p) (14) 


with [cf. chap. VII, relation (A-28)]: 
R, lr) ao r' (15) 


With the presence of the r? dr term in the integration volume element taken into account, 
the function to be integrated over r behaves at the origin like r'*?273 = ¿+21 Further- 
more, the presence of the Hermitian operator L in (13) means that the matrix element 


< Patan | Why | Peet > 


is zero when / or /' is zero. We then have / + l > 2, and r'*!’—! remains finite at the 
origin. 


b. COUPLING WITH THE ELECTRON SPIN 


We shall see that, for the last term of (5), the problems related to the singularity 
at the origin of the vector potential (4) are important. This is why, in studying this 
term, we shall choose a proton of finite size, letting its radius approach zero at the 
end of the calculations. Furthermore, from a physical point of view, we now know 
that the proton does possess a certain spatial extension and that its “magnetism” is 
spread over a certain volume. However, the dimensions of the proton are much 
smaller than the Bohr radius ay. This justifies our treating the proton as a point 
particle in the final stage of the calculation. 


a. The magnetic field associated with the proton 


Consider the proton to be a particle of radius pọ (fig. 2), placed at the origin. 
The distribution of magnetism inside the proton creates, at a distant point, a field B 
which can be calculated by attributing to the proton a magnetic moment M, which 
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we shall choose parallel to Oz. For r > p>, we obtain the components of B from 
the curl of the vector potential written in (4): 


B, = 43m % 
4n r5 
Ho yz 
B, = — 3M 16 
» dn" (16) 
2. 2 
Baliy 
An y? 


FIGURE 2 


The magnetic field created by the proton. Outside the proton, the field is that of a dipole. Inside, 
the field depends on the exact distribution of the magnetism of the proton, but we can, in a first 
approximation, consider it to be uniform. The contact term corresponds to the interaction between 
the spin magnetic moment of the electron and this uniform field B, inside the proton. 


Expressions (16), moreover, remain valid even if r is not very large compared to Po 
We have already emphasized that the proton, since it is a spin 1/2 particle, has no 
magnetic multipole moments of order k > 1. The field outside the proton is there- 


fore a pure dipole field. 
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Inside the proton, the magnetic field depends on the exact magnetic distribution. 
We shall assume this field B, to be uniform (by symmetry, it must then be 
parallel to M, and, therefore, to Oz)*. 

To calculate the field B, inside the proton, we shall write the equation stating 
that the flux of the magnetic field through a closed surface, bounded by the xOy plane 
and the upper hemisphere centered at O and of infinite radius, is zero. Since, 
asr — 00, [B| decreases as 1/r?, the flux through this hemisphere is zero. Therefore, 
if D (pọ) denotes the flux through a disk centered at O of radius pọ in the xOy plane, 
and ®,(po), the flux through the rest of the xOy plane, we have: 


®( Po) + ®.(Po) = 0 (17) 


Relations (16) enable us to calculate ®,(p,) easily, and we get: 


As for the flux 9,(p,) of B,, it is equal to: 


P(Po) = TpoB, (19) 


so that (17) and (18) yield: 


B, = m2 (20) 
4n po 


Thus, we know the values of the field created by the proton at all points in space. 
We can now calculate the part of W, related to the electron spin S. 


B. The magnetic dipole term 


If we substitute (16) into the term — 2y, (5) . V x A,, we obtain the operator: 


h 
dip_ _ Ho 24M, XS, + YS, + ZS, S, 
Wis? = An i fz RS = R3 (21) 


that is, taking into account the fact that M, is, by hypothesis, parallel to Oz: 


ip _ Ho 2up 1 (S . RIM, .R) 
wai = Lo 2482 jg M, - 32 AP 22 
M mà rat i R? (22) 


* The following argument can be generalized to cases where B, varies in a more complicated 
fashion (see comment (ii) at the end of this complement). 
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This is the expression for the Hamiltonian of the dipole-dipole interaction between 
two magnetic moments M, and M; = 2y,S/h (cf. complement By} $1). 


Actually, expression (16) for the magnetic field created by the proton is valid 
only for r > p,, and (22) should be applied only to the part of the wave functions 
which satisfies this condition. However, when we let pọ approach zero, expres- 
sion (22) gives no singularity at the origin; it is therefore valid in all space. 


Consider the matrix element: 


È Primel WEE | Ovr POD 


(we are adding here the indices e and e' to the states | , ,,,, » considered above in order to label 
the eigenvalues £h/2 and e'h/2 of S,) and, in particular, the radial integral which corresponds to 
it. At the origin, the function of r to be integrated behaves like r!*'"*273 = /1+"=1. Now, 
according to condition (8-c) of complement B,,, the non-zero matrix elements are obtained for 
1 + l > 2. There is therefore no divergence at the origin. In the limit where pọ ——> 0, the 
integral over r becomes an integral from 0 to infinity, and expression (22) is valid in all space. 


y. The contact term 


We shall now substitute (20) into the last term of (5), so as to obtain the 
contribution of the internal field of the proton to W,,. We then obtain an ope- 
rator W;,, which we shall call the “contact term”, and whose matrix elements in 
the {|kim > ) representation are: 


< Pklme | Wis | Preme? 
2 
= -fe pM E Ie |S, |e mall dir PË lr) Py 1 rl) (23) 
rs<po 


Let p, approach zero. The integration volume, 4np3/3, also approaches zero, and 
the right-hand side of (23) becomes: 


2u4gM ,. ST 
— Fete < E| SiE > Pim =0) Prr wlt =0) (24) 


The contact term is therefore given by: 


pee a o (5) (R) (25) 


Therefore, although the volume containing an internal magnetic field (20) approaches 
zero when pọ —> 0, the value of W;, remains finite, since this internal field 
approaches infinity as 1/p;. 


COMMENTS: 
(i) In (25), the function 6(R) of the operator R is simply the projector: 
6(R) =|r=0><r=0| (26) 
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(ii) The matrix element written in (23) is different from zero only if / = /' = 0. 
This is a necessary condition for €, ,,,(r =0) and Ppr mlr = 0) to be 
non-zero (cf. chap. VII, §C-4-c-B). The contact term therefore exists only 
for the s states. 


(iii) In order to study, in § 2-a, the coupling between M, and the orbital angular momentum 
of the electron, we assumed expression (4) for A,(r) to be valid in all space. This amounts 
to ignoring the fact that the field B actually has the form (20) inside the proton. We might 
wonder if this procedure is correct, or if there is not also an orbital contact term in Wy; 

Actually, this is not the case. The term in P . A, + A,. P would lead, for the 
field B,, to an operator proportional to: 


2 
B.L=% mL, (27) 
4n p 


Let us calculate the matrix element of such an operator in the { | p, , m > } representation. 
The presence of the operator L, requires, as above, /, /' > 1. The radial function to be 
integrated between 0 and po then behaves at the origin like r!*'*? and therefore goes 
to zero at least as rapidly as r*. Despite the presence of the 1/p¿ term in (27), the integral 
between r = 0 and r = pọ therefore goes to zero in the limit where py ——> 0. 


3. Conclusion: the hyperfine-structure Hamiltonian 


Now, let us take the sum of the operators W;,, wee and W}. We use the 
fact that the magnetic dipole moment M, of the proton is proportional to its angular 
momentum I: 


I 
M, = 20) (28) 
(cf. $B-2-a of chapter XII). We obtain: 
Ho 2UgH,9» | 1. L (I.R\(S.R) I.S 8z e 
W,=- “222 — + 3 2 _ - — +— 1.8 0(R 29 
a 4n K R? R5 RÈ 3 R Me) 


This operator acts both in the state space of the electron and in the state space of 
the proton. It can be seen that this is indeed the operator introduced in chapter XII 


[ef. (B-20)]. 


COMMENTS: 


(i) We will now discuss the generalization of formula (29) to the case of an 
atom having a nuclear spin J > 1/2. 
First of all, if Z = 1, we have already seen that the nucleus can have an electric 
quadrupole moment which adds a contribution to the potential V (r) given 
by (2). An electric quadrupole hyperfine term is therefore present in the 
hyperfine Hamiltonian, in addition to the magnetic dipole term (29). Since 
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an electrical interaction does not directly affect the electron spin, this quadru- 
pole term only acts on the orbital variables of the electrons. 

If now / > 1, other nuclear electric or magnetic multipole moments can 
exist, increasing in number as / increases. The electric moments give rise to 
hyperfine terms acting only on the orbital electron variables, while the 
magnetic terms act on both the orbital and the spin variables. For elevated 
values of /, the hyperfine Hamiltonian has therefore a complex structure. In 
practice however, for the great majority of cases, one can limit the hyperfine 
Hamiltonian to magnetic dipole and electric quadrupole terms. This is due 
to the fact that the multipole nuclear moments of an order superior to 2 make 
extremely small contributions to the hyperfine atomic structures. These 
contributions are therefore difficult to observe experimentally. This arises 
essentially from the extremely small size of the nuclei compared to the spatial 
extent a, of the electronic wave functions. 


The simplifying hypothesis which we have made concerning the field B(r) created by 
the proton (a perfectly uniform field within a sphere, a dipole field outside) is not 
essential. The form (25) of the magnetic dipole Hamiltonian remains valid whenever 
the nuclear magnetism has an arbitrary repartition, giving rise to more complicated 
internal fields B,(r) (assuming however that the spatial extent of the nucleus is negligible 
compared to dy; cf. the following comment). The argument is actually a direct 
generalization of that given in this complement. Consider a sphere S, centered at the 
origin, containing the nucleus and having a radius £ < dp. 

If Z = 1/2, the field outside S, has the form (16) and, since £ is very small 
compared to dp, its contribution leads to the terms (13) and (22). As for the contribution 
of the field B(r) inside S,, it depends only on the value at the origin of the electronic 
wave functions and on the integral of B(r) inside S,. Since the flux of B(r) across all 
closed surfaces is zero, the integral in S, of each component of B(r) can be transformed 
into an integral outside of S,, where B(r) has the form (16). A simple calculation will 
again give exactly expression (25) which is therefore independent of the simplifying 
hypothesis that we have made. 

If Z > 1/2, the nuclear contribution to the electromagnetic field outside of S, 
gives rise to the multipole hyperfine Hamiltonian which we have discussed in comment (i) 
above. On the other hand, one can easily show that the contribution of the field 
inside S, does not give rise to any new term : only the magnetic dipole possesses a contact 
term. 


In all of the above, we have totally neglected the dimensions of the nucleus compared 
to those of the electronic wave functions (we have taken the limit py/@y) ——= 0). This is 
obviously not always realistic, in particular for heavy atoms whose nuclei have 
a relatively large spatial extension. If one studies these “volume effects” (keeping for 
example several of the lower order terms in pp/ap), a series of new terms appears in the 
electron-nucleus interaction Hamiltonian. We have already encountered this type of effect 
in complement D,, where we studied the effects of the radial distribution of the nuclear 
charge (nuclear multipole moments of order k = 0). Analogous phenomena occur 
concerning the spatial distribution of nuclear magnetism and lead to modifications of 
different terms of the hyperfine Hamiltonian (29). In particular, a new term must be 
added to the contact term (25) when the electronic wave functions vary significantly 
within the nucleus. This new term is neither simply proportional to 6(R), nor to the total 
magnetic moment of the nucleus. It depends on the spatial distribution of the nuclear 
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magnetism. From a practical point of view, such a term is interesting since, using 
precise measurements of the hyperfine structure of heavy atoms, it permits obtaining 
information concerning the variation of the magnetism within the corresponding nuclei. 


References and suggestions for further reading: 


The hyperfine Hamiltonian including the electric quadrupole interaction term: 
Abragam (14.1), chap. VI; Kuhn (11.1), chap. VI, $ B; Sobel’man (11.12), chap. 6. 
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Complement Byun 


CALCULATION OF THE MEAN VALUES 
OF THE FINE-STRUCTURE HAMILTONIAN 
IN THE 1s, 2s AND 2p STATES 


1. Calculation of < 1/R >, < 1/R? > and < 1/R? > 

2. The mean values < Wae > 

3. The mean values < Wp > 

4. Calculation of the coefficient ¢,, associated with W,, in the 2p level 


For the hydrogen atom, the fine-structure Hamiltonian +V, is the sum of three terms: 


Wr = Wao + Woo + Wo (1) 


studied in $ B-1 of chapter XII. 

The aim of this complement is to give the calculation of the mean values of 
these three operators for the 1s, 2s and 2p states of the hydrogen atom, a calculation 
which was omitted in chapter XII for the sake of simplicity. We shall begin by 
calculating the mean values of < 1/R >, < 1/R? > and < 1/R? > in these states. 


1. Calculation of <1/R>», <1/R?> and <1/R? > 


The wave function associated with a stationary state of the hydrogen atom is 
(cf. chap. VII, 8 C): 


On,1.mlt) = Ro, 1 (1) YT(8, p) (2) 


Y (0, p) is a spherical harmonic. The expressions for the radial functions R,, ¡(r) 
corresponding to the ls, 2s, 2p states are: 


Ry or) = Aa) 3 e7" 


R;,(r) = 22aq)-*°(1 _ x) e77/2ao A 
240 
R>,1(r) NN (2a) 3/2(3)7 1/2 +. e7r/2a0 
0 
where dy is the Bohr radius: 
2 2 
ag = 4n€, h = h 5 a 


ma? me 
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The Y7 are normalized with respect to O and g, so that the mean value < R? > of the 
qth power (where q is a positive or negative integer) of the operator R associated 
with r = |r| in the state | p, ,,, > can be written*: 


< R° > alm = | ee [Ro dr (5) 
0 


It therefore does not depend on m. If (3) is substituted into (5), there appear integrals 
of the form: 


I(k, p) = | rk e7 pr'a dy (6) 
0 


where p and k are integers. We shall assume here that k > 0, that is, q > — 2. 
An integration by parts then yields directly : 


I(k, p) = | — LL e= yk + Ho | p1 go priao dy 
p P |, 


0 


= —2 i(k — 1, p) (7) 


Since, furthermore: 


100, p) = | sg mdp = 5 (8) 
0 
we obtain, by recurrence: 
a k+1 
Hk, ») = (32) 0) 


Now, let us apply this result to the mean values to be determined. We obtain: 
4 j — 2r/a 
C1/R>i, =3| re e dr 
0 


I(1, 2) = — (10-a) 


4 f° r |? 
I/R 5a; = — r| 1 — — | e7” dr 
ió 845 j l A 


1 1 l 
= fua, 1) - — 1(2, 1) a e J 


Ag ag 


= (10-b) 


* Of course, this mean value exists only for values of q which make integral (5) convergent. 
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a 
A e) PO (10-c) 
2403 ” 4ao 
Similarly : 
4 2 
CIR? >a, =£ 100,2) =£ (11-a) 
do ao 
curo, = — | no, 1) -+ 10,1) +E 1, J aa (11-b) 
= we ao 4a? 4a? 
l 1 
IJR? 5, =—1(Q, 1) = 11-c 
< / 22p 24a5 ( ) 12a3 ( ) 


It is clear that the expression for the mean value of 1/R? is meaningless for the 1s 
and 2s states [since integral (5) is divergent]. For the 2p state, it is equal to: 


I 1 
1/R? = — 1(1, 1) = 12 
< / ar 24a8 ( ) 24a3 ( ) 
2. The mean values < Wm > 
Let: 
P? 
H, = om, +V (13) 
be the Hamiltonian of the electron subjected to the Coulomb potential. 
We have: 
P* = 4m?[H, — VI? (14-a) 
with: 
2 
e 
V = — R (14-b) 
so that: 
-= -— [M - V} (15) 


o E 
2m,c 


We shall take the mean values of both sides of this expression in a 
state | P, ¡ m ». Since Ho and V are Hermitian operators, we obtain: 
l 


2m,c 


< Wo adam iam. [(E, y + 2E; e? < 1/R Pha + et < 1/R? nt (16) 


,2 
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In this expression, we have set: 


EEN = EREE (17) 


a == (18) 


is the fine-structure constant. 
If we apply relation (16) to the case of the 1s state, we obtain, using (10-a) 


and (11-a): 
2 4 
LA ! > È atm?ct — o2m,c? — + 25] (19) 
2m,c” | 4 ao as 


that is, since, according to (4) and (18), e?/aọ = a?m,c?: 


C Mine Das = 7 pene] E | = q am (20) 


The same type of calculation, for the 2s state, leads to: 


2 
O TEC = -L ame 
{ Wav as = 5 A Mec (5) 2 3 + | = — 75g "eC (21) 
and, for the 2p state, to: 
in acl, A) -211 a A 
< Wav Dap = = 7 me (5) -251 +2 = — ggg mc (22) 
3. The mean values < W, > 
With (14-b) and the fact that 4(1/r) = — 4mó(r) taken into account, the 
mean value of W, in the | 9, ,,, > state can be written [see also formula (B-14) 
of chapter XII]: 
h? 2 2 
(Wo atm = Ane” l6,,, lr = 0) (23) 


2,2 
8m¿c 


This expression goes to zero if y, , , (r = 0) = 0, that is, if / 4 0. Therefore: 


< Wp >a = 9 (24-a) 

For the ls and 2s levels, we obtain, using (2), (23) and the fact that Y} = 1/ Van : 
h? 1 
Wp Xis = —— €? |R; o0)? = — a*tm,c? E 

< Wp) mia |R; ,o(0)| Ti (24-b) 
as well as: 

< Wp Xas = d e? |R, 0(0)? = L gime? (24-c) 

D725 8m2c? i 16 “ 
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Calculation of the coefficient ¿,, associated with Ws, 
in the 2p level 


In $ C-2-c-B of chapter XII, we defined the coefficient : 


a f Reat ar (25) 


2,2 
2m¿c 


According to (3): 


Bc iA) (26) 
?P  2m2c? 24a” 


Relation (9) then yields: 


ee E ae a 
P 2m2c? 24a? 48h? 


References: 


Several radial integrals for hydrogen-like atoms are given in Bethe and Salpeter (11.10). 
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Complement Cx 


THE HYPERFINE STRUCTURE AND THE ZEEMAN EFFECT 
FOR MUONIUM AND POSITRONIUM 


1. The hyperfine structure of the Is ground state 
2. The Zeeman effect in the 1s ground state 


a. The Zeeman Hamiltonian 

b. Stationary state energies 

c. The Zeeman diagram for muonium 
d. The Zeeman diagram for positronium 


In complement Ayp, we studied some hydrogen-like systems, composed, like the 
hydrogen atom, of two oppositely charged particles electrostatically attracted to 
each other. Of all these systems, two are particularly interesting : muonium 
(composed of an electron, e”, and a positive muon, 4” ) and positronium (composed 
of an electron, e”, and a positron, e* ). Their importance lies in the fact that the 
various particles which come into play (the electron, the positron and the muon) 
are not directly affected by strong interactions (while the proton is). The theoretical 
and experimental study of muonium and positronium therefore permits a very 
direct test of the validity of quantum electrodynamics. 

Actually, the most precise information we now possess about these two 
systems comes from the study of the hyperfine structure of their ls ground state 
[the optical lines joining the 1s ground state to the various excited states have 
just recently been observed for positronium; cf. Ref. (11.25)]. This hyperfine 
structure is the result, as in the case of the hydrogen atom, of magnetic interactions 
between the spins of the two particles. We shall describe some interesting features 
of the hyperfine structure and the Zeeman effect for muonium and positronium 
in this complement. 


1. The hyperfine structure of the 1s ground state 


Let S, be the electron spin and S,, the spin of the other particle (the muon 
or the positron, which are both spin 1/2 particles). The degeneracy of the Is ground 
state is then, as for hydrogen, four-fold. 

We can use stationary perturbation theory to study the effect on the Is 
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ground state of the magnetic interactions between S, and S,. The calculation is 
analogous to the one in $D of chapter XII. We are left with a problem of two 
spin 1/2’s coupled by an interaction of the form: 


AS, . S, (1) 


where . is a constant which depends on the system under study. We shall denote 
by Ay, Y y, Ap the three values of . which correspond respectively to hydrogen, 
muonium and positronium. 

It is easy to see that:. 


Gy < Ay < Ap (2) 


since the smaller the mass of particle (2), the larger its magnetic moment. Now the 
positron is about 200 times lighter than the muon, which is close to 10 times lighter 
than the proton. 


COMMENT: 


The theory of chapter XII is insufficient for the extremely precise study of the 
hyperfine structure of hydrogen, muonium and positronium. In particular, the hyperfine 
Hamiltonian W; given in $B-2 of this chapter describes only part of the interactions 
between particles (1) and (2). For example, the fact that the electron and the positron 
are antiparticles of each other (they can annihilate to produce photons) is responsible for 
an additional coupling between them which has no equivalent for hydrogen and muonium. 
In addition, a series of corrections (relativistic, radiative, recoil effects, etc.) must be taken 
into account. These are complicated to calculate and must be treated by quantum 
electrodynamics. Finally, for hydrogen, nuclear corrections are also involved which are 
related to the structure and polarizability of the proton. However, it can be shown that 
the form (1) of the coupling between S, and S, remains valid, the constant being given 
by an expression which is much more complicated than formula (D-8) of chapter XII. 
The hydrogen-like systems studied in this complement are important precisely because 
they enable us to compare the theoretical value of . with experimental results. 


The eigenstates of «/S, . S, are the | F, mp > states, where F and mp are the 
quantum numbers related to the total angular momentum: 


F=S, +8, (3) 


As in the case of the hydrogen atom, F can take on two values, F = 1 and F = 0. 
The two levels, F = 1 and F = 0, have energies equal to .4%?/4 and — 3.4h?/4, 
respectively. Their separation /h* gives the hyperfine structure of the ls ground 
state. Expressed in MHz, this interval is equal to: 


h 
Sq Lu = 4463.317 + 0.021 MHz (4) 


for muonium, and: 


h 
zz Le = 203403 + 12 MHz (5) 


for positronium. 
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2. The Zeeman effect in the 1s ground state 


a. THE ZEEMAN HAMILTONIAN 


If we apply a static field B, parallel to Oz, we must add, to the hyperfine 
Hamiltonian (1), the Zeeman Hamiltonian which describes the coupling of B, to 
the magnetic moments: 


M, =y18, (6) 
and: 

M, = 728, (7) 
of the two spins, with gyromagnetic ratios y, and y,. If we set: 

WO, = — Bo (8) 

w, = — YaBo (9) 


this Zeeman Hamiltonian can be written : 
DSi: + 0,8), (10) 
In the case of hydrogen, the magnetic moment of the proton is much smaller 
than that of the electron. We used this property in $ E-1 of chapter XII to neglect 
the Zeeman coupling of the proton, compared to that of the electron*. Such an 
approximation is less justified for muonium, since the magnetic moment of the 
muon is larger than that of the proton. We shall therefore take both terms of (10) 


into account. For positronium, furthermore, they are equally important: the 
electron and positron have equal masses and opposite charges, so that: 


MA (11) 
or: 


0, = —0, (12) 


b. STATIONARY STATE ENERGIES 

When B, is not zero, it is necessary, in order to find the stationary state 
energies, to diagonalize the matrix representing the total Hamiltonian: 

AS .S, + 0,S,, + 0,8», (13) 
in an arbitrary orthonormal basis, for example, the { | F, mp > } basis. A calculation 


which is analogous to the one in $E-4 of chapter XII then leads to the following 
matrix (the four basis vectors are arranged in the order 


(|, 1), |, —1)>,[1,0>,/0,0>)): (14) 


* Recall that the gyromagnetic ratio of the electron spin is y, = 2u,/h (Hp : the Bohr magneton). 
Thus, if we set œ = — ugBo/h (the Larmor angular frequency), the constant œw, defined by (8) is equal 
to 2w, (this is, furthermore, the notation used in $E of chapter XII; to obtain the results of that section, 
it therefore suffices, in this complement, to replace w, by 2, and w, by 0). 
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(14) 


Matrix (14) can be broken down into two 1 x 1 submatrices and a 2 x 2 
submatrix. Two eigenvalues are therefore obvious: 


_ fh? th 


E, = a +50 + w,) (15) 
2 
E, = Fw, +0) (16) 


They correspond, respectively, to the states | 1, 1 > and | 1, — 1 >, which, moreover, 
coincide with the states | +, + > and | —, — > of the { |e,, e, > } basis of common 
eigenstates of S,, and S,,. The other two eigenvalues can be obtained by diagonalizing 
the remaining 2 x 2 submatrix. They are equal to: 


fh? — Nan R? : 
Ey = -+ Fal 5 ) + (0, — 2) (17) 
Bh? AN m2 p 
E=- - lew +5 (o, - 0) (18) 
In a weak field, they correspond to the states | 1, 0 > and | 0, 0 >, respectively, and, 
in a strong field, to the states | +, — >and | —, + >. 
c. THE ZEEMAN DIAGRAM FOR MUONIUM 


The only differences with the results of $ E-4 of chapter XII arise from the 
fact that here, we are taking the Zeeman coupling of particle (2) into account. 
These differences appear only in a sufficiently strong field. 

Let us therefore consider the form taken on by the energies E, and E, 
when fi(w, — w,) > oh?. In this case: 


Ah? h 

E, = as +5(0, — W) (19) 
ah À 

E, = - a 3 (0, — 0)) (20) 


Now, compare (19) with (15) and (20) with (16). We see that, in a strong field, 
the energy levels are no longer represented by pairs of parallel lines, as was the 
case in $ E-3 of chapter XII. The slopes of the asymptotes of the E, and £, levels 
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f h 
are, respectively, — Eo, + y2) and — 50 — Y); those of the £, and E, levels, 


h h , 
3 (y, + y,)and 5 (y, — Y2). Since the two particles (1) and(2) have opposite charges, 


y, and y, have opposite signs. Consequently, in a sufficiently strong field, 
the E, level (which then corresponds to the | +. — > state) moves above the E, level 


R l h l 
(the | +, + > state), since its slope, — 50 — y») is greater than — 7 (y, + 72). 


The distance between the E, and E, levels therefore varies in the following way with 
respect to By (cf. fig. 1): starting from 0, it increases to a maximum for the value of By which 
makes the derivative of: 


Ah? 


E, — E; = > + F(B,) (21) 


equal to zero, with: 


f 


h i 
F(Bo) = —5 (Y, + ¥2)Bo — i 


2 —— (y, — 2)" (22) 


4 


a ¿PB 
2 


The distance then goes to zero again, and finally increases without bound. As for the distance 
between the E, and E, levels, it starts with the value .4h?, decreases to a minimum for the 
value of B, which makes the derivative of: 


Hh? 
2 


E,- E, = = F(Bo) (23) 


equal to zero and then increases without bound. 

Since it is the same function f(B,) that appears in (21) and (23), we can show that, for 
the same value of B, [the one which makes the derivative of f(B,) go to zero], the distances 
between the E, and E, levels and between the E, and E, levels are either maximal or 
minimal. This property was recently used to improve the accuracy of experimental determi- 
nations of the hyperfine structure of muonium. 

By stopping polarized muons (for example, in the | + > state) in a rare gas target, one 
can prepare, in a strong field, muonium atoms which will be found preferentially in the | +, + > 
and | —, + > states. If we then apply simultaneously two radiofrequency fields whose frequencies 
are close to (E, — E,)/h and (E, — E,)/h, we induce resonant transitions from | +, + > to 
| +, — > and from | —, + > to | —, — > (arrows in figure 1). It is these transitions which are 
detected experimentally, since they correspond to a flip of the muon spin which is revealed by 
a change in the anisotropy of the positrons emitted during the B-decay of the muons. If we are 
operating in a field B, such that the derivative of f (Bo) is zero, the inhomogeneities of the static 
field, which may exist from one point to another of the cell containing the rare gas, are not 
troublesome, since the resonant frequencies of muonium, (E, — £3)/h and (E, — E,)/h, are 
not affected, to first order, by a variation of By [ref. (11.24)]. 


COMMENT: 


For the ground state of the hydrogen atom, we obtain a Zeeman 
diagram analogous to the one in figure | when we take into account the 
Zeeman coupling between the proton spin and the field Bo. 


1266 


www.elsolucionario.net 


MUONIUM AND POSITRONIUM : HYPERFINE STRUCTURE, ZEEMAN EFFECT 


E A 


N 
N 
N 
N 
N 
o 
J N ¿En 
a? \ 
N 
N 
N 
N 
N 
N 
O 
Bo 
0 _—_—>H 


FIGURE ! 


The Zeeman diagram for the ls ground state of muonium. Since we are not neglecting here the 
Zeeman coupling between the magnetic moment of the muon and the static field B,, the two straight 
lines (which correspond, in a Strong field, to the same electron spin orientation but different muon 
spin orientations) are no longer parallel, as was the case for hydrogen (in the Zeeman diagram of 
figure 9 of chapter XII, the Larmor angular frequency œ, of the proton was neglected). 
For the same value of the static field B,, the splitting between the £, and E), levels is maximal and 
that between the £, and £, levels is minimal. The arrows represent the transitions studied experi- 
mentally for this value of the field B,. 
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d. THE ZEEMAN DIAGRAM FOR POSITRONIUM 


If we set w, = — w, (this property is a direct consequence of the fact that 
the positron is the antiparticle of the electron) in (15) and (16), we see that the E, 
and E, levels are independent of By: 


bh? (24) 


E =E= 4 


On the other hand, we obtain from (17) and (18): 


Ah? AN an 
Ah? ARNT 
E, = -% - ft > ) + h?y?B2 (26) 


The Zeeman diagram for positronium therefore has the form shown in figure 2. 
It is composed of two superposed straight lines parallel to the B, axis and one 
hyperbola. 


FIGURE 2 


The Zeeman diagram for the ls ground 
state of positronium. As in the cases of 
hydrogen and muonium, this diagram is 
composed of one hyperbola and two straight 
lines. However, since the gyromagnetic 
ratios of the electron and positron are 
eqnal and opposite, the two straight lines 
have a zero slope and, consequently, are 
superposed (in the two corresponding 
states, with energy E, and E,, the total 
magnetic moment is zero, since the electron 
and positron spins are parallel). The arrow 
represents the experimentally studied 
transition. 
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Actually, positronium is not stable. It decays by emitting photons. In a zero field, it can 
be shown by symmetry considerations that the F = 0 state (the singlet spin state, or “para- 
positronium ”) decays by emitting two photons. Its half-life is of the order of 7, = 1.25 x 107 *%sec. 
On the other hand, the F = | state (the triplet spin state, or “orthopositronium”) can decay 
only by emitting three photons (since the two-photon transition is forbidden). This process is 
much less probable, and the half-life of the triplet is much longer, on the order of 
tT, = 1.4 x 1077 sec. 

When a static field is applied, the E, and E, levels retain the same lifetimes since the 
corresponding eigenstates do not depend on By. On the other hand, the | 1, 0 > state is “mixed” 
with the | 0, 0 > state, and vice versa. Calculations analogous to those of complement Hy show 
that the lifetime of the E, level is reduced relative to its zero-field value 7, (that of the E, level 
is increased relative to the value 7,). The positronium atoms in the E, state then have a certain 
probability of decaying by emission of two photons. 

This inequality of the lifetimes of the three states of energies E,, E,, Ez when Bọ is 
non-zero is the basis of the methods for determining the hyperfine structure of positronium. 
Formation of positronium atoms by positron capture by an electron generally populates the four 
states of energies E,, E,, E,, E, equally. In a non-zero field By, the two states E, and E, decay 
less rapidly than the E, state, so that in the stationary state, they are more populated. If we 
then apply a radiofrequency field oscillating at the frequency 


(E; E E, )/h = (E; = E,)/h, 


we induce resonant transitions from the £, and £, states to the £, state (the arrow of figure 2). 
This increases the decay rate via two-photon emission, which permits the detection of resonance 
when (with fixed Bọ) we vary the frequency of the oscillating field. Determination of E, — E, 
for a given value of B, then allows us to find the constant .? by using (24) and (25). 

In a zero field, resonant transitions could also be induced between the unequally 
populated F = 1 and F = 0 levels. However, the corresponding resonant frequency, given 
by (5), is high and not easily produced experimentally. This is why one generally prefers to use 
the “low frequency” transition represented by the arrow of figure 2. 


References and suggestions for further reading: 


See the subsection “Exotic atoms” of section 11 of the bibliography. 
The annihilation of positronium is discussed in Feynman IH (1.2), $18-3. 
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Complement Dxn 


THE INFLUENCE OF THE ELECTRONIC SPIN 
ON THE ZEEMAN EFFECT OF THE HYDROGEN RESONANCE LINE 


1. Introduction 

2. The Zeeman diagrams of the Is and 2s levels 
3. The Zeeman diagram of the 2p level 

4. The Zeeman effect of the resonance line 


a. Statement of the problem 
b. The weak-field Zeeman components 7 
c. The strong-field Zeeman components. The Paschen-Back effect 


1. Introduction 


The conclusions of complement Dy; relative to the Zeeman effect for the 
resonance line of the hydrogen atom spectrum (the ls <= 2p transition) must be 
modified to take into account the electron spin and the associated magnetic 
interactions. This is what we shall do in this complement, using the results obtained 
in chapter XII. 

To simplify the discussion, we shall neglect effects related to nuclear spin 
(which are much smaller than those related to the electron spin). Therefore, we 
shall not take the hyperfine coupling W, (chap. XII, $ B-2) into account, choosing 
the Hamiltonian H in the form : 


H, is the electrostatic Hamiltonian studied in chapter VII ($ C), W,, the sum of the 
fine structure terms (chap. XII, §B-1): 


We = Way + W + Wo (2) 


mp 


and W,, the Zeeman Hamiltonian (chap. XII, §E-1) describing the interaction 
of the atom with a magnetic field B, parallel to Oz: 


Wz = D(L, + 2S,) (3) 


where the Larmor angular frequency wọ is given by: 


pea 


e 


Bo (4) 


[we shall neglect œw, relative to w; see formula (E-4) of chapter XI]. 
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We shall determine the eigenvalues and eigenvectors of H by using a method 
analogous to that of $E of chapter XH : we shall treat W, and W; like pertur- 
bations of Hy. Although they have the same unperturbed energy, the 2s and 2p levels 
can be studied separately since they are connected neither by W, (chap. XII, 
§C-2-a-B) nor by W,. In this complement, the magnetic field B, will be called 
weak or strong, depending on whether W, is small or large compared to W,. Note 
that the magnetic fields considered here to be “weak” are those for which W; 
is small compared to W, but large compared to W,,, which we have neglected. 
These “weak fields” are therefore much stronger than those treated in § E of 
chapter XII. 

Once the eigenstates and eigenvalues of H have been obtained, it is possible 
to study the evolution of the mean values of the three components of the electric 
dipole moment of the atom. Since an analogous calculation was performed in 
detail in complement D,,, we shall not repeat it. We shall merely indicate, for 
weak fields and for strong fields, the frequencies and polarization states of the 
various Zeeman components of the resonance line of hydrogen (the Lyman « line). 


2. The Zeeman diagrams of the 1s and 2s levels 


We saw in $ D-1-b of chapter XII that W, shifts the Is level as a whole and 
gives rise to only one fine-structure level, 1s,,,. The same is true for the 2s level, 
which becomes 2s,,,. In each of these two levels, we can choose a basis: 


1 1 
fin1= 0m = 0m = gim 25> ) (5) 


of eigenvectors common to H,, L?, L,, S,, Z, (the notation is identical to that of 
chapter XII; since H does not act on the proton spin, we shall ignore m, in all 
that follows). 

The vectors (5) are obviously eigenvectors of W, with eigenvalues 2m5ho,. 
Thus, each ls, ,, or 2s,,, level splits, in a field Bọ, into two Zeeman sublevels of 
energies : 


E(n; l = 0: m, = 0; mg) = Elus, ,,) + 2mshw, (6) 


where E(ns,,) is the zero-field energy of the ns,,, level, calculated in $$ C-2-b 
and D-1-b of chapter XII. The Zeeman diagram of the 1s, ,, level (as well as the one 
for the 2s,,, level) is therefore composed of two straight lines of slopes + 1 
and — | (fig. 1), corresponding, respectively, to the two possible orientations of 
the spin relative to B¿(ms, = + 1/2 and ms = — 1/2). 

Comparison of figure | and figure 9 of chapter XII shows that to neglect, 
as we are doing here, the effects related to nuclear spin amounts to considering 
fields B, which are so large that W, > W,,. We are then in the asymptotic region 
of the diagram of figure 9 of chapter XII, where we can ignore the splitting of the 
energy levels due to the proton spin and hyperfine coupling. 
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E 


E(ls 1/2) 
FIGURE 1 


The Zeeman diagram of the 1s,,, level when 
the hyperfine coupling $Y;, is neglected. The 
ordinate of the point at which the two levels 
ms = + 1/2 cross is the energy of the 1s,,, 
level (i.e, the eigenvalue — E, of H,, corrected 
for the global shift produced by the fine- 
structure Hamiltonian W,). Figure 9 of chap- 
ter XII gives an idea of the modifications of 
this diagram produced by W,,. 


The Zeeman diagram of the 2p level 

In the six-dimensional 2p subspace, we can choose one of the two bases: 

(|1=2;1=1;m,;m5>)) (7) 
or: 

(|n=2;1=1;J;m,>) (8) 


adapted, respectively, to the individual angular momenta L and S and to the total 
angular momentum J = L + S [cf. (36-a) and (36-b) of complement A, ]. 

The terms W,,,,and Wp which appear in expression (2) for W, shift the 2p level 
as a whole. Therefore, to study the Zeeman diagram of the 2p level, we simply 
diagonalize the 6 x: 6 matrix which represents Wi + W; in either one of the two 
bases, (7) or (8). Actually, since W; and Wey = €,,L.S both commute with 
J, = L, + S,, this 6 x 6 matrix can be broken down into as many submatrices 
as there are distinct values of m,. Thus, there appear two one-dimensional 


submatrices (corresponding respectively to m, = + 3/2 and m; = — 3/2) and two 
two-dimensional submatrices (corresponding respectively to m, = + 1/2 and 
m, = — 1/2). The calculation of the eigenvalues and associated eigenvectors 


(which is very much like that of $ E-4 of chapter XII) presents no difficulties and 
leads to the Zeeman diagram shown in figure 2. This diagram is composed of two 
straight lines and four hyperbolic branches. 
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EA 


E(2p3/2) 


E(2p) 
FIGURE 2 
The Zeeman diagram of the 2p level 
when the hyperfine coupling V,, is 
neglected. In a zero field, we find the 
fine-structure levels, 2p,,, and 2p, ,. 
E(2p 1/2) y a 


The Zeeman diagram is composed of 
two straight lines and two hyperbolas 
(for which the asymptotes are shown 
in dashed lines). The hyperfine cou- 
pling 4;, would significantly modify 
this diagram only in the neighborhood 
of œ = 0. EQp) is the 2p level 
energy (the eigenvalue — E,/4 of Ho) 
corrected for the global shift produced 
a > by Wp + Wp 


In a zero field, the energies depend only on J. We obtain the two fine- 
structure levels, 2p,,, and 2p,,,, already studied in $ C of chapter XII. whose 
energies are equal to: 


Elpa) = ÈP) +5 bap? 0) 


E(2p,)2) = E(2p) — €,,h? (10) 


E(2p) is the 2p level energy E(2p) corrected for the global shift due to W, and W, 
[cf. expressions (C-8) and (C-9) of chapter XII]. é., is the constant which appears 
in the restriction €,,L . S of Wo to the 2p level [ cf. expression (C-13) of chapter XII ]. 

In weak fields (W; < Wzo). the slope of the energy levels can also be obtained 
by treating W; like a perturbation of W,. It is then necessary to diagonalize the 4 x 4 
and 2 x 2 matrices representing W, in the 2p3,, and 2p,,, levels. Calculations 
analogous to those of § E-2 of chapter XII show that these two submatrices are 
respectively proportional to those which represent «ww, in the same subspaces. 
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The proportionality coefficients, called “ Landé factors” (cf. complement Dy, 83), 
are equal, respectively, to*: 


pa) = 5 (11) 
9(2P1)2) = 5 (12) 


In weak fields, each fine-structure level therefore splits into 2J + 1 equidistant 
Zeeman sublevels. The eigenstates are the states of the “coupled” basis, (8), 
corresponding to the eigenvalues: 


E( J, my) = EQp,) + m; g(2py) ho, (13) 


where the E(2p,) are given by expressions (9) and (10). 

In strong fields (W; > Mo), we can, on the other hand, treat Wen = £,,L.S 
like a perturbation of Wz, which is diagonal in basis (7). As in $ E-3-b of chapter XII, 
it can easily be shown that only the diagonal elements of ¿,,L.S are involved 
when the corrections are calculated to first order in W¿¿. Thus, we find that in 
strong fields, the eigenstates are the states of the “decoupled” basis, (7), and the 
corresponding eigenvalues are: 


E(m,, ms) = E(2p) + (m; + 2m5)h0w, + Mims ae (14) 


Formula (14) gives the asymptotes of the diagram of figure 2. 

As the magnetic field B, increases, we pass continuously from basis (8) to 
basis (7). The magnetic field gradually decouples the orbital angular momentum 
and the spin. This situation is the analogue of the one studied in §E of chapter XII, 
in which the angular momenta S and I were coupled or decoupled, depending 
on the relative importance of the hyperfine and Zeeman terms. 


4. The Zeeman effect of the resonance line 


a. STATEMENT OF THE PROBLEM 


Arguments of the same type as those of §2-c of complement D,,,, (see, in 
particular, the comment at the end of that complement) show that the optical 
transition between a 2p Zeeman sublevel and a ls Zeeman sublevel is possible only 
if the matrix element of the electric dipole operator gR between these two states 
is different from zero**. In addition, depending on whether it is the g(X + iY), 
q(X — iY) or gZ operator which has a non-zero matrix element between the two 


* These Landé factors can be calculated directly from formula (43) of complement Dy. 
** The electric dipole, since it is an odd operator, has no matrix elements between the Is and 
2s states, which are both even. This is why we are ignoring the 2s states here. 
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Zeeman sublevels under consideration, the polarization state of the emitted light 
is ot, o or n. Therefore, we use the previously determined eigenvectors and 
eigenvalues of H in order to obtain the frequencies of the various Zeeman 
components of the hydrogen resonance line and their polarization states. 


COMMENT: 


The g(X + iY), g(X — iY) and qZ operators act only on the orbital part of the 
wave function and cause m, to vary, respectively, by + 1, — 1 and 0 (cf. complement Dy yy, 
$ 2-c); mg is not affected. Since m, = m, + mg is a good quantum number (whatever the 
strength of the field By), Am, = + 1 transitions have a o* polarization; dm, = — 1 
transitions, a a” polarization; and 4m, = 0 transitions, a n polarization. 


b. THE WEAK-FIELD ZEEMAN COMPONENTS 


Figure 3 shows the weak-field positions of the various Zeeman sublevels 
resulting from the Is,)., 2p,,, and 2p}, levels, obtained from expressions (6), 
(13), (11) and (12). The vertical arrows indicate the various Zeeman components 
of the resonance line. The polarization is o*, o or n, depending on whether 
Am, = + 1, — lor9. 

Figure 4 shows the positions of these various components on a frequency 
scale, relative to the zero-field positions of the lines. The result differs notably from 
that of complement D,,, (see figure 2 of that complement), where, observing in a 
direction perpendicular to B,, we had three equidistant components of polari- 
zation o*, z, o”, separated by a frequency difference wo/2z. 


> 
4r aft 
~~~ ----+~----+------+-------------------- > 
a 
v 
b 
o R T at oc coco Rn T o* ot 
Wo 
2n 
e > 
FIGURE 4 


Frequencies of the various Zeeman components of the hydrogen resonance line, 

a) In a zero field: two lines are observed, separated by the fine-structure splitting 3¢,,h/4x (¿,, is 
the spin-orbit coupling constant of the 2p level) and corresponding respectively to the transitions 
2P3/2 <> is; (the line on the right-hand side of the figure) and 2p,,, <—> 1s,,, (the line on 
the left-hand side). 

b) In a weak field B,: each line splits into a series of Zeeman components whose polarizations are 
indicated; w,/27 is the Larmor frequency in the field Bp. 
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30 32 12 
3/2 — 1/2 
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2 12 1/2 
E(2p 4/2) t 3 Mo 1/2 = 1/2 
T 
| E 
| 
j 1/2 1/2 
E(1s, 2) | 2hw 
1/2 — 1/2 
FIGURE 3 


The disposition, in a weak field, of the Zeeman sublevels arising from the fine-structure levels, 
Is, /2, 2P1 2, 2P3;2 (whose zero-field energies are marked on the vertical energy scale). On the right- 
hand side of the figure are indicated the splittings between adjacent Zeeman sublevels (for greater 
clarity, these splittings have been exaggerated with respect to the fine-structure splitting which 
separates the 2p,,, and 2p,,, levels), as well as the values of the quantum numbers J and m, asso- 
ciated with each sublevel. The arrows indicate the Zeeman components of the resonance line, each 
of which has a well-defined polarization, o*, o or z. 
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1 h? 
—— l5 Dhoo +> Ep 
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(2p) a, 
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2 
a 2hw, + 3 Čap 
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E(1s,,3) 
— hay 
FIGURE 5 


The disposition, in a strong field (decoupled fine structure), of the Zeeman sublevels arising from 
the ls and 2p levels. On the right-hand side of the figure are indicated the values of the quantum 
numbers m, and m, associated with each Zeeman sublevel, as well as the corresponding energy, 
given relative to E(is,,,) or E(2p). The vertical arrows indicate the Zeeman components of the 
resonance line. 
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c. THE STRONG-FIELD ZEEMAN COMPONENTS. THE PASCHEN-BACK EFFECT 


Figure 5 shows the strong-field positions of the Zeeman sublevels arising 
from the 1s and 2p levels [see expressions (6) and (14)]. To first order in We, 
the degeneracy between the states |m, = — l,m = 1/2) and|m, = 1,m; = — 1/2) 
is not removed. The vertical arrows indicate the Zeeman components of the 
resonance line. The polarization is o *, a” or z, depending on whether 4m, = + 1, 
— 1l or 0 (recall that in an electric dipole transition, the quantum number my is not 
affected ). 

The corresponding optical spectrum is shown in figure 6. The two z transitions 
have the same frequency (cf. fig. 5). On the other hand, there is a small splitting, 
hé,,/2n, between the frequencies of the two o* transitions and between those 
of the two a” transitions. The mean distance between the o* doublet and the z line 
(or between the z line and the a” doublet) is equal to w,/22. The spectrum of 
figure 6 is therefore similar to that of figure 2 of complement Dyu. Furthermore, 


Heap ht, 
2r on 
—» ——» 
(o ~ | | O ~ THI o + ot 
&————— L > 
Wy $ _— —_ —————————— > 
> w 
2n ¿0 
2n 
FIGURE 6 


The strong-field positions of the Zeeman components of the hydrogen resonance line. Aside from 
the splitting of the c* and o” lines, this spectrum is identical to the one obtained in complement D,,,,, 
where the effects related to electron spin were ignored. 


the splitting of the c* and c` lines, due to the existence of the electron spin, is 
easy to understand. In strong fields, L and S are decoupled. Since the ls <> 2p 
transition is an electric dipole transition, only the orbital angular momentum L 
of the electron is affected by the optical transition. An argument analogous to the 
one in §E-3-b of chapter XII shows that the magnetic interactions related to the 
spin can be described by an “internal field” which adds to the external field B, 
and whose sign changes, depending on whether the spin points up or down. It is 
this internal field that causes the splitting of the o* and g” lines (the z line is not 
affected, since its quantum number m, is zero). 


References and suggestions for further reading: 


Cagnac and Pebay-Peyroula (11.2), chaps. XI and XVII (especially $5-A of that 
chapter); White (11.5), chap. X; Kuhn (11.1), chap. HI, §F ; Sobel’man (11.12), chap. 8, 
§ 29. 


1278 


www.elsolucionario.net 
STARK EFFECT FOR THE HYDROGEN ATOM 


Complement Ex; 


THE STARK EFFECT FOR THE HYDROGEN ATOM 


1. The Stark effect on the 7 = 1 level 


a. The shift of the ts level is quadratic in € 
b. Polarizability of the \s state 


2. The Stark effect on the » = 2 level 


Consider a hydrogen atom placed in a uniform static electric field 8 parallel to Oz. 
To the Hamiltonian studied in chapter XII must be added the Stark Hamiltonian W, 
which describes the interaction energy of the electric dipole moment gR of the atom 
with the field &. W, can be written: 


Wx=-38.R= — géZ (1) 


Even for the strongest electric fields that can be produced in the laboratory, 
we always have W, < H,. On the other hand, if & is strong enough, W, can have 
the same order of magnitude as W, and W,, or be even larger. To simplify the 
discussion, we shall assume throughout this complement that & is strong enough 
for the effect of W; to be much larger than that of W, or W,,. We shall therefore 
calculate directly, using perturbation theory, the effect of W, on the eigenstates 
of H, found in chapter VII (the next step, which we shall not consider here, would 
consist of evaluating the effect of W,, and then of W,,, on the eigenstates 
of Hy + W). 

Since both Ho and W; do not act on the spin variables, we shall ignore the 
quantum numbers ms and m,. 


1. The Stark effect on the n = 1 level 


a. THE SHIFT OF THE 1s STATE IS QUADRATIC IN & 


According to perturbation theory, the effect of the electric field can be 
obtained to first order by calculating the matrix element: 


— q€<n=1,1=0,m, =0|Z|n=1,1=0,m, =0). 


Since the operator Z is odd, and since the ground state has a well-defined parity 
(it is even), the preceding matrix element is zero. 

There is therefore no effect which is linear in é, and we must go on to the 
next term of the perturbation series: 


(2) 


= 9s? Y Eas 
es i 


n 
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where E, = — E,/n’ is the eigenvalue of H, associated with the eigenstate | n, /, m > 
(cf. chap. VII, §C). The preceding sum is certainly not zero, since there exist 
states | n, /, m > whose parity is opposite to that of | 1, 0, 0 >. We conclude that, 
to lowest order in &, the Stark shift of the 1s ground state is quadratic. Since £, — E, 
is always negative, the ground state is lowered. 


b. POLARIZABILITY OF THE 1s STATE 


We have already mentioned that, for reasons of parity, the mean values of the 
components of the operator gR are zero in the state | 1, 0, 0 > (the unperturbed 
ground state). 

In the presence of an electric field $ parallel to Oz, the ground state is no 
longer | 1, 0, 0 >, but rather (according to the results of $ B-!-b of chapter XT): 


sk Z|11,0,0 
Wo =/1,0,0) — 46 Y Imtm) 2 miZ1L007, (3) 
1 n 


nl 
im 


This shows that the mean value of the electric dipole moment gR in the perturbed 
ground state is, to first order in 6, < Wo | qR | Yo >. Using expression (3) for | Yo >, 
we then obtain: 


{Wo laR| yo > = = ql y 


ni 
tym 
¢ 1,0,0 Rin m><n, Ll m Z|1,0,0> +<1,0,0]Z nt m><n LmiR 1,0,0> 
E, = Ey 


(4) 


Thus, we see that the electric field & causes an “induced” dipole moment to 
appear, proportional to 6. It can easily be shown, by using the spherical harmonic 
orthogonality relation*, that < Wo | qX | Yo > and < Wo | qY | Wo > are zero, and 
that the only non-zero mean value is: 


2 
(Yo laz] wo) =- 288 y Keen Pa a 


nil 

L, 
In other words, the induced dipole moment is parallel to the applied field &. This 
is not surprising, given the spherical symmetry of the Is state. The coefficient of 
proportionality y between the induced dipole moment and the field is called the 
linear electric susceptibility. We see that quantum mechanics permits the calculation 
of this susceptibility for the ls state: 


Aglma |Z | 1, 0,057 
Xis = — 29? 2 K = —E : (6) 
1 n 


n+1 
l,m 


* This relation implies that < 1, 0, 0 | Z |n, l, m> is different from zero only if / = 1, m = 0 
(the argument is the same as the one given for < 2, 1, m | Z | 2, 0, 0) in the beginning of $ 2 below). 
Consequently, in (2), (3), (4), (5), (6), the summation is actually carried out only over n (it includes, 
furthermore, the states of the positive energy continuum). 


1280 


www.elsolucionario.net 
STARK EFFECT FOR THE HYDROGEN ATOM 


2. The Stark effect on the n = 2 level 


The effect of W, on the n =2 level can be obtained to first order by 
diagonalizing the restriction of W, to the subspace spanned by the four states of 
the [|2,0,0,>:]2,1.m>.m = — 1,0, + 1 } basis. 

The | 2, 0, 0 > state is even; the three | 2. 1, m > states are odd. Since W, is 
odd, the matrix element < 2, 0, 0 | W | 2. 0, 0 > and the nine matrix elements 
< 2, 1, m | W; | 2. 1. m > are zero (cf. complement F,,). On the other hand, since 
the | 2, 0, 0, > and | 2. 1. m > states have opposite parities, < 2, 1, m | W | 2,0, 0 > can 
be different from zero. 

Let us show that actually only < 2, 1,0 | Mg | 2, 0, 0 > is non-zero. My is pro- 
portional to Z = R cos 0 and, therefore, to Y ?(0). The angular integral which enters 
into the matrix elements < 2, 1. m | W | 2, 0. 0 > is therefore of the form: 


| Y0) ¥9(Q2) ¥9(2) dQ 


Since Y? is a constant, this integral is proportional to the scalar product of Y? 
and Y” and is therefore different from zero only if m = 0. Moreover, since Y?, 
R),o(r) and R,,(r) are real, the corresponding matrix element of W is real. We shall 
set: 

(2,1,0]M|2,0,0) = 96 (7) 


without concerning ourselves with the exact value of y [which could be calculated 
without difficulty since we know the wave functions @, , o(t) and @ o o(r)]. 

The matrix which represents W in the n = 2 level, therefore, has the following 
form (the basis vectors are arranged in the order | 2, 1, 1 >, | 2,1, — 1 >,|2,1,0), 
| 2.0.0 >): 


oJololo| 
o 
| 0 0 Ey i 9 | (8) 
0 0: 0,7g 
pana 
i 0: 0; 7€| 0 


We can immediately deduce the corrections to first order in & and the eigenstates 
to zeroeth order: 


Eigenstates Corrections 
2,1, 1> es 0 
2 k= 15 PEN 0 
1 
AA <> yE 
2 
(2105-2003); es aay (9) 


Sl 
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COMPLEMENT Ex: 


Thus, we see that the degeneracy of the n = 2 level is partially removed and that 
the energy shifts are /inear, and not quadratic, in &. The appearance of a linear 
Stark effect is a typical result of the existence of two levels of opposite parities 
and the same energy, here the 2s and 2p levels. This situation exists only in the 
case of hydrogen (because of the /-fold degeneracy of the n # | shells). 


COMMENT: 


The states of the n = 2 level are not stable. Nevertheless, the lifetime 
of the 2s state is considerably longer than that of the 2p states, since the atom 
passes easily from 2p to ls by spontaneous emission of a Lyman a photon 
(lifetime of the order of 107° sec), while decay from the 2s state requires the 
emission of two photons (lifetime of the order of a second). For this reason, 
the 2p states are said to be unstable and the 2s state, metastable. 

Since the Stark Hamiltonian W; has a non-zero matrix element between 
2s and 2p, any electric field (static or oscillating) “mixes” the metastable 2s 
state with the unstable 2p state, greatly reducing the 2s state’s lifetime. This 
phenomenon is called “ metastability quenching” (see also complement Hyy, 
in which we study the effect of a coupling between two states of different 
lifetimes). 


References and suggestions for further reading: 
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The Stark effect in atoms: Kuhn (11.1), chap. III, §§A-6 and G. Ruark and 
Urey (11.9), chap. V, §§12 and 13; Sobel’man (11.12), chap. 8, §28. 

The summation over the intermediate states which appears in (2) and (6) can be 
calculated exactly by the method of Dalgarno and Lewis; see Borowitz (1.7), §14-5; 
Schiff (1.18), $33. Original references: (2.34), (2.35), (2.36). 

Quenching of metastability: see Lamb and Retherford (3.11), App. 11; Sobel'man 
(11.12), chap. 8, $ 28.5. 
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A. STATEMENT OF THE PROBLEM 


Consider a physical system with Hamiltonian H,. The eigenvalues and eigenvectors 
of Hy will be denoted by E, and | p, >: 


Ho | o, >= E, |, > (A-1) 


For the sake of simplicity, we shall consider H, to be discrete and non-degenerate; 
the formulas obtained can easily be generalized (see, for example, §C-3). We 
assume that H, is not explicitly time-dependent, so that its eigenstates are stationary 
States. 

At ¢ = 0, a perturbation is applied to the system. Its Hamiltonian then 
becomes: 


H(t) = H, + W(t) (A-2) 


with: 
W(t) = AW(t) (A-3) 


where 4 is a real dimensionless parameter much smaller than | and W(t) is an 
observable (which can be explicitly time-dependent) of the same order of magnitude 
as H, and zero for 1 < 0. 

The system is assumed to be initially in the stationary state |; >, an 
eigenstate of Ho of eigenvalue E,. Starting at t = 0 when the perturbation is 
applied, the system evolves: the state | y; > is no longer, in general, an eigenstate 
of the perturbed Hamiltonian. We propose, in this chapter, to calculate the pro- 
bability A,(¢) of finding the system in another eigenstate | py > of Ho at time /. 
In other words, we want to study the transitions which can be induced by the 
perturbation W (t) between the stationary states of the unperturbed system. 

The treatment is very simple. Between the times 0 and ż, the system evolves 
in accordance with the Schródinger equation: 


mE W(t) > = [Ho +0) | > (A-4) 


The solution | W(t) > of this first-order differential equation which corresponds 
to the initial condition: 


| u(t = 0)> = | @,> (A-5) 
is unique. The desired probability 4,(1) can be written: 
P(t) = Ko | W(t) >? (A-6) 
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The whole problem therefore consists of finding the solution | y(r) > of (A-4) 
which corresponds to the initial condition (A-5). However, such a problem is not 
generally rigorously soluble. This is why we resort to approximation methods. 
We shall show in this chapter how, if 4 is sufficiently small, the solution | (1) > 
can be found in the form of a limited power series expansion in 4. Thus, we shall 
calculate |y(/)> explicitly to first order in 2, as well as the corresponding 
probability ($ B). The general formulas obtained will then be applied ($ C) to the 
study of an important special case, the one in which the perturbation is a sinusoidal 
function of time or a constant (the interaction of an atom with an electro- 
magnetic wave, which falls into this category, is treated in detail in comple- 
ment Ax). This is an example of the resonance phenomenon. Two situations 
will be considered : the one in which the spectrum of H, is discrete, and then the 
one in which the initial state | œ; > is coupled to a continuum of final states. In the 
latter case, we shall prove an important formula known as * Fermi's golden rule”. 


COMMENT: 


The situation treated in $C-3 of chapter IV can be considered to be a 
special case of the general problem discussed in this chapter. Recall that, 
in chapter IV. we discussed a two-level system (the states | p, > and | p, >). 
initially in the state | p, >, subjected, beginning at time ¢ = 0, to a constant 
perturbation W. The probability 4 ,(t) can then be calculated exactly, leading 
to Rabi's formula. 

The problem we are taking up here is much more general. We shall 
consider a system with an arbitrary number of levels (sometimes, as in $C-3, 
with a continuum of states) and a perturbation W(t) which is an arbitrary 
function of the time. This explains why, in general, we can obtain only an 
approximate solution. 


B. APPROXIMATE SOLUTION 
OF THE SCHRODINGER EQUATION 


1. The Schrödinger equation in the { | 9, > | representation 


The probability 4,(1) explicitly involves the eigenstates |g; > and | 9, > 
of Ho. It is therefore reasonable to choose the | | o, > | representation. 


a. THE SYSTEM OF DIFFERENTIAL EQUATIONS FOR THE COMPONENTS 
OF THE STATE VECTOR 


Let c,(/) be the components of the ket | w(t) > in the {| ~, > ; basis: 


| Wit) > = Y eb | on > (B-1) 


n 
with: 


ot) = <p, | WO > (B-2) 
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V(t) denotes the matrix elements of the observable W(t) in the same basis : 

< Pn | W(t) | or > = Wali) (B-3) 
Recall that H, is represented in the { | p, > } basis by a diagonal matrix: 

< Pn | Ho | Pk) = En One (B-4) 


We shall project both sides of Schrödinger equation (A-4) onto | y, >. To do so, 
we shall insert the closure relation: 


Llacas (B-5) 


and use relations (B-2), (B-3) and (B-4). We obtain: 
d 
¡Mo lt) = En elt) + lO ct) (B-6) 


The set of equations (B-6), written for the various values of n, constitutes a system 
of coupled linear differential equations of first order in t, which enables us, in 
theory, to determine the components c,(+) of | w(t) >. The coupling between these 
equations arises solely from the existence of the perturbation ÀW (t), which, 
by its non-diagonal matrix elements AW, W(t), relates the evolution of c,(/) to that of 
all the other coefficients c,(t). 


b. CHANGING FUNCTIONS 


When AW(t) is zero, equations (B-6) are no longer coupled, and their 
solution is very simple. It can be written : 


c,(t) = b, een (B-7) 


where b, is a constant which depends on the initial conditions. 

If, now, AW (t) is not zero, while remaining much smaller than H, because 
of the condition 4 < 1, we expect the solution c,(t) of equations (B-6) to be very 
close to solution (B-7). In other words, if we perform the change of functions: 


Cyt) = b, (1) em (B-8) 


we can predict that the b,(+) will be slowly varying functions of time. 
We substitute (B-8) into equation (B-6); we obtain: 


ih ar d z Bale) ) + E, b „(t) e —iEnt/h 


n 


= E, b(t) e ER + Y ZW, bt) e7 E (B-9) 
k 


+HE, ih 


We now multiply both sides of this relation by e , and introduce the Bohr 


angular frequency : 


E, — Es 


z (B-10) 


Onk = 
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related to the pair of states E, and E,. We obtain: 


ih 2 b (t) = AY lene W(t) bt) (B-11) 
k 
2. Perturbation equations 


The system of equations (B-11) is rigorously equivalent to Schródinger 
equation (A-4). In general, we do not know how to find its exact solution. This is 
why we shall use the fact that 4 is much smaller than 1 to try to determine this 
solution in the form of a power series expansion in 4 (which we can hope to be 
rapidly convergent if 4 is sufficiently small): 


b(t) = b(t) + ADD + ADD +... (B-12) 


If we substitute this expansion into (B-11), and if we set equal the coefficients 
of 4” on both sides of the equation, we find: 
(i) forr = 0: 
inp) = 0 (B-13) 
dr ” 


since the right-hand side of (B-11) has a common factor 4. Relation (B-13) 
expresses the fact that b/% does not depend on t. Thus, if 4 is zero, b,(t) reduces to 
a constant [cf. (B-7)]. 


(ii) forr 40: 
mo A A O) (B-14) 
k> 


Thus we see that, with the zeroeth-order solution determined by (B-13) and the 
initial conditions, recurrence relation (B-14) enables us to obtain the first-order 
solution. It then furnishes the second-order solution in terms of the first-order one, 
and, by recurrence, the solution to any order r in terms of the one to order r — 1. 


3. Solution to first order in A 


a. THE STATE OF THE SYSTEM AT TIME t 


For 1 < 0, the system is assumed to be in the state | g; >. Of all the coeffi- 
cients b,(t). only 5,(t) is different from zero (and, furthermore, independent of z 


since AW is then zero). At time 1 = 0, AW (t) may become discontinuous in passing 
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from a zero value to the value AW (0). However, since AW (t) remains finite, the 
solution of the Schrödinger equation is continuous at ¢ = 0. It follows that: 


b,(t = 0) = ô; (B-15) 


and this relation is valid for all 4. Consequently, the coefficients of expansion (B-12) 
must satisfy : 


BO = 0) = ôni (B-16) 

b(t = 0) = 0 if rèl (B-17) 

Equation (B-13) then immediately yields, for all positive 1: 

bDO(t) = ôni (B-18) 
which completely determines the zeroeth-order solution. 

This result now permits us to write (B-14), for r = 1, in the form: 


in-i B10) = D eie Walt) dy, 
dt A 


= eo W(t) (B-19) 


an equation which can be integrated without difficulty. Taking into account initial 
condition (B-17), we find: 


t 


be) = = | eien Y, (1) de (B-20) 
l 
0 


If we now substitute (B-18) and (B-20) into (B-8) and then into (B-1), we 
obtain the state | W(t) > of the system at time z, calculated to first order in 4. 
b. THE TRANSITION PROBABILITY Y, -(t) 


According to (A-6) and definition (B-2) of c,(¢), the transition probability 4 (1) 
is equal to |c,(t)|?, that is, since b,(t) and c,(t) have the same modulus [cf. (B-8)]: 


2 (1) = |b, OF (B-21) 


where: 
b(t) = DOE) RADO +. (B-22) 


can be calculated from the formulas established in the preceding section. 
From now on, we shall assume that the states | o; > and | y > are different. 


We shall therefore be concerned only with the transitions induced by ¿W(t) 
between two distinct stationary states of Hy. We then have b(t) = 0, and, 
consequently: 


P(t) = Pp)? (B-23) 
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Using (B-20) and replacing AW (t) by W(t) [cf. (A-3)], we finally obtain: 


2 


pt 


| elon W (t) de (B-24) 
0 


Consider the function Wilt’), which is zero for t’ < 0 and £' > t, and equal 
to W,¡() for 0 < f' < t (cf. fig. 1). W(t’) is the matrix element of the perturbation 
“seen” by the system between the time ¢ = 0 and the measurement time ¢, when 
we try to determine if the system is in the state | g, >. Result (B-24) shows that Z,(t) 
is proportional to the square of the modulus of the Fourier transform of the 
perturbation actually “seen”, w, (t'). This Fourier transform is evaluated at an 
angular frequency equal to the Bohr angular frequency associated with the transition 
under consideration. 


FIGURE 1 


The variation of the function Walt) with 
respect to 1’. Walt) coincides with W,,(t’) in 
the interval 0 < ¢’ < ¢, and goes to zero 
outside this interval. It is the Fourier transform 


of W,(t') that enters into the transition pro- 
0 t t’ bability 2, ,(1) to lowest order. 


Note also that the transition probability 4,(t) is zero to first order if the 
matrix element W,,(t) is zero for all z. 


COMMENT: 


We have not discussed the validity conditions of the approximation 
to first order in A. Comparison of (B-11) with (B-19) shows that this 
approximation simply amounts to replacing, on the right-hand side of (B-11), 
the coefficients b,(t) by their values b,(0) at time 1 = 0. It is therefore clear 
that, so long as £ remains small enough for b,(0) not to differ very much 
from 6,(t), the approximation remains valid. On the other hand, when 1 
becomes large, there is no reason why the corrections of order 2, 3, etc. in A 
should be negligible. 
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C. AN IMPORTANT SPECIAL CASE: 
A SINUSOIDAL OR CONSTANT PERTURBATION 


1. Application of the general equations 


Now assume that W(t) has one of the two simple forms: 


W(t) = W sin of (C-1-a) 
Wit) = Wcos wt (C-1-b) 


where W is a time-independent observable and w, a constant angular frequency. 
Such a situation is often encountered in physics. For example, in complements A, y, 
and By, We consider the perturbation of a physical system by an electromagnetic 
wave of angular frequency œw; &,(t) then represents the probability, induced by the 
incident monochromatic radiation, of a transition between the initial state | @; > 
and the final state | p, >. ` F 

With the particular form (C-1-a) of W (t), the matrix elements W,;(t) take on 
the form: 


A A 


z Wo iw — to! 
W(t) = Wp sin ot = re t — e~et) (C-2) 


where W, is a time-independent complex number. Let us now calculate the state 
vector of the system to first order in 4. If we substitute (C-2) into general 
formula (B-20), we obtain : 
W t 
b(t) a cal [often toy a eer) dt’ (C-3) 
0 


The integral which appears on the right-hand side of this relation can easily be 
calculated and yields: 


WwW. [ 1 — enito 1 — eloni oN 
z| Ou FO 0-0 ] (E) 


Therefore, in the special case we are treating, general equation (B-24) becomes: 


W.12]] — ellos tor 1 — eller 0) 
P(t; w) = 7? bO = [Wal LSe e 


PE l (C-5-a) 


wr +O Dri — O 


(we have added the variable œw in the probability Z,, since the latter depends on the 
frequency of the perturbation). 

If we choose the special form (C-1-b) for W(t) instead of (C-1-a), a calculation 
analogous to the preceding one yields: 


IW? Í 
P, t; œ) = a 
ip ) h2 


las elos; + o) L= elos: - w)t 


(C-5-b) 
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W cos wt becomes time-independent if we choose w = 0. The transition proba- 
bility Z(t) induced by a constant perturbation W can therefore be obtained by 
replacing œw by 0 in (C-5-b): 


2 
P kt) = Wel? |1 = elos |2 


hot, 
Ww, 
= => Fe o) (C-6) 
with: 
. 2 
y — | sin (0,,t/2) i 
F(t, co p) [eee 73 (C-7) 


In order to study the physical content of equations (C-5) and (C-6), we shall 
first consider the case in which | ø; > and | ø; > are two discrete levels ($ 2), and 
then the one in which | @, > belongs to a continuum of final states ($ 3). In the 
first case, Z(t; œ) [or 2 ,(1)] really represents a transition probability which can be 
measured, while, in the second case, we are actually dealing with a probability 
density (the truly measurable quantities then involve a summation over a set of final 
states). From a physical point of view, there is a distinct difference between these 
two cases. We shall see in complements C,,,, and Dyg that, over a sufficiently 
long time interval, the system oscillates between the states | o; > and | P; > in the 
first case, while it leaves the state | y, > irreversibly in the second case. 

In §2, in order to concentrate on the resonance phenomenon, we shall choose 
a sinusoidal perturbation, but the results obtained can easily be transposed to the 
case of a constant perturbation. Inversely, we shall use this latter case for the dis- 
cussion of §3. 


2. Sinusoidal perturbation which couples two discrete states: 
the resonance phenomenon 


a. RESONANT NATURE OF THE TRANSITION PROBABILITY 


When the time ¢ is fixed, the transition probability %,,(t: œ) is a function 
only of the variable w. We shall see that this function has a maximum for: 


WO = Wy; (C-8-a) 
or: 

WO = — Wy; (C-8-b) 
A resonance phenomenon therefore occurs when the angular frequency of the 
perturbation coincides with the Bohr angular frequency associated with the pair 


of states | p, > and | py, >. If we agree to choose w > 0, relations (C-8) give the 
resonance conditions corresponding respectively to the cases w,; > 0 and wp; < 0. 
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In the first case (cf. fig. 2-a), the system goes from the lower energy level E, to the 


higher level E, by the resonant absorption of an energy quantum ho. In the second 
case (cf. fig. 2-b), the resonant.perturbation stimulates the passage of the system 


E E. 

H |r) p” 

E, E 

[ —______—— |», 
a b 

FIGURE 2 


The relative disposition of the energies E, and E, associated with the states | y, > and | o, >. If 
E; < E, (fig. a), the |g; > — > | , > transition occurs through absorption of an energy 
quantum w. If, on the other hand, E, > £, (fig. b), the lo; > — |o,» transition occurs 
through induced emission of an energy quantum hc». 


from the higher level E, to the lower level E, (accompanied by the induced emission 
of an energy quantum fw). Throughout this section, we shall assume that @,; 
is positive (the situation of figure 2-a). The case in which w,; is negative could 
be treated analogously. 

To reveal the resonant nature of the transition probability, we note that 
expressions (C-5-a) and (C-5-b) for #,(t; œ) involve the square of the modulus 
of a sum of two complex terms. The first of these terms is proportional to: 


Laer sin [(@,; + )t/2] 
A. = ———— i l op to A Af Ney o. 
il Wr +O ds (w, + @)/2 (698) 
and the second one, to: 
A_= al = — ¡elos ow sin [(w y, — @)t/2] [o i — o)t/2] (C-9-b) 
WO; — O (os — @)/2 


The denominator of the A_ term goes to zero for w = w,,, and that of the 4, 
term, for w = — wp; Consequently, for w close to œp; we expect only the 4_ 
term to be important; this is why it is called the “resonant term”, while the 4, term 
is called the “anti-resonant term” (4, would become resonant if, for negative Opio 
w were close to — W,;). 

Let us then consider the case in which: 


|v- 05 < os] (C-10) 
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neglecting the anti-resonant term A, (the validity of this approximation will be 
discussed in § c below). Taking (C-9-b) into account, we then obtain: 


Wl? 
P(t: 0) = ML reo — yj) (C-11) 
with: 
sin [(w,; — w)t/2] oy) i 
F(t, o — 05) = Oe e Sa i (C-12) 


Figure 3 represents the variation of %,(f; œ) with respect to œ, where + is fixed. 
It clearly shows the resonant nature of the transition probability. This probability 
presents a maximum for w = wp; when it is equal to |W, ¡|?+?/4h?. As we move away 
from @,,;, it decreases, going to zero for |w — ol- = 2n/t. When Jo — 05] 
continues to increase, it oscillates between the value [W,;|?/K?(w — wp} and zero 
(“diffraction pattern”). 


0 
FIGURE 3 


Variation, with respect to w, of the first-order transition probability Z, (z; œw) associated with a 
sinusoidal perturbation of angular frequency «w:;1 is fixed. When w ~ w fi» A resonance appears whose 
intensity is proportional to /? and whose width is inversely proportional to /. 


b. THE RESONANCE WIDTH AND THE TIME-ENERGY UNCERTAINTY RELATION 


The resonance width Aw can be approximately defined as the distance 
between the first two zeros of Z(t; œ) about œ = w,,. It is inside this interval 
that the transition probability takes on its largest values [the first secondary 
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maximum of .4,, attained when (w — w,,)1/2 = 3n/2, is equal to | W,,|?17/977h?, 


that is, less than 5 “o of the transition probability at resonance]. We then have: 


Aw = s (C-13) 


The larger the time +, the smaller this width. 

Result (C-13) presents a certain analogy with the time-energy uncertainty 
relation (cf. chap. IH, $D-2-e) Assume that we want to measure the energy 
difference E, — E, = hw,, by applying a sinusoidal perturbation of angular 
frequency « to the system and varying « so as to detect the resonance. If the 
perturbation acts during a time /, the uncertainty AE on the value £, — E; will be, 
according to (C-13), of the order of: 


AE =hAw =t (C-14) 


Therefore. the product tAF cannot be smaller than A. This recalls the time-energy 
uncertainty relation, although z here is not a time interval characteristic of the free 
evolution of the system, but is externally imposed. 


c. VALIDITY OF THE PERTURBATION TREATMENT 


Now let us examine the limits of validity of the calculations leading to 
result (C-11). We shall ‘irst discuss the resonant approximation, which consists of 
neglecting the anti-resonant term 4,. and then the first-order approximation in 
the perturbation expansion of the state vector. 


x. Discussion of the resonant approximation 


Using the hypothesis œ > «,;. we have neglected A , relative to A_. We shall 
therefore compare the moduli of 4, and 4_. 

The shape of the function |4 (œ)? is shown in figure 3. Since 
lA, lol = 4 -(— o)l. |A, (0)? can be obtained by plotting the curve which is 
symmetric with respect to the preceding one relative to the vertical axis o = 0. 
If these two curves, of width Aw, are centered at points whose separation is much 
larger than Ac», it is clear that. in the neighborhood of e = op; the modulus 
of A, is negligible compared to that of 4.. The resonant approximation is 
therefore justified on the condition* that: 


Ao, > Aer (C-15) 
that is, using (C-13): 


a (C-16) 


t> 
lo pl () 


Result (C-11) is therefore valid only if the sinusoidal perturbation acts during 
a time £ which is large compared to Ic. The physical meaning of such a condition 


* Note that if condition (C-15) is not satisfied, the resonant and anti-resonant terms interfere: 
2 


it is not correct to simply add |4, |? and [a |? 
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is clear: during the interval [O, 1], the perturbation must perform numerous 
oscillations to appear to the system as a sinusoidal perturbation. If, on the other 
hand, ¢ were small compared to I/w, the perturbation would not have the time 
to oscillate and would be equivalent to a perturbation varying linearly in time 
[in the case (C-1-a)] or constant [in the case (C-1-b) ]. 


COMMENT: 


For a constant perturbation, condition (C-16) can never be satisfied, 
since œ is zero. However, it is not difficult to adapt the calculations of $ b 
above to this case. We have already obtained [in (C-6)] the transition 
probability Z,(¢) for a constant perturbation by directly setting œ = 0 in 
(C-5-b). Note that the two terms A, and A_ are then equal, which shows 
that if (C-16) is not satisfied, the anti-resonant term is not negligible. 

The variation of the probability 2,(t) with respect to the energy 
difference hw,, (with the time ż fixed) is shown in figure 4. This probability 
is maximal when œ, = 0, which corresponds to what we found in $b above: 
if its angular frequency is zero, the perturbation is resonant when wp; = 0 
(degenerate levels). More generally, the considerations of § b concerning the 
features of the resonance can be transposed to this case. 


FIGURE 4 


Variation of the transition probability 2,,(1) asso- 
ciated with a constant perturbation with respect to 
wri = (E, — E,)/h, for fixed 1. A resonance appears, 
centered about (w,, = 0 (conservation of energy), 
with the same width as the resonance of figure 3, but 
an intensity four times greater (because of the cons- 
tructive interference of the resonant and anti-resonant 
0 Wp; terms, which, for a constant perturbation, are equal). 


B. Limits of the first-order calculation 


We have already noted (cf. comment at the end of §B-3-b) that the first-order 
approximation can cease to be valid when the time £ becomes too large. This 
can indeed be seen from expression (C-11), which, at resonance, can be written: 


Pirin Sy Lido (C-17) 
if > fi 4h? 
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This function becomes infinite when £ —» œ, which is absurd, since a probability 
can never be greater than 1. 

In practice, for the first-order approximation to be valid at resonance, the 
probability in (C-17) must be much smaller than 1, that is*: 


h 
t<— C-18 
|W, a 


To show precisely why this inequality is related to the validity of the first-order approximation, 
it would be necessary to calculate the higher-order corrections from (B-14) and to examine under 
what conditions they are negligible. We would then see that although inequality (C-18) is 
necessary, it is not rigorously sufficient. For example, in the terms of second or higher order, 
there appear matrix elements Wi, of W other than Wi, on which certain conditions must be 
imposed for the corresponding corrections to be small. 


Note that the problem of calculating the transition probability when ¢ does 
not satisfy (C-18) is taken up in complement Cy% in which an approximation 
of a different type is used (the secular approximation). 


3. Coupling with the states of the continuous spectrum 


If the energy £, belongs to a continuous part of the spectrum of Ho, that 
is, if the final states are labeled by continuous indices, we cannot measure the 
probability of finding the system in a well-defined state | e,> at time f. The 
postulates of chapter III indicate that in this case the quantity |< p, | W(t) >|? which 
we found above (approximately) is a probability density. The physical predictions 
for a given measurement then involve an integration of this probability density over 
a certain group of final states (which depends on the measurement to be made). 
We shall consider what happens to the results of the preceding sections in this 
case. 


a. INTEGRATION OVER A CONTINUUM OF FINAL STATES; DENSITY OF STATES 


a. Example 


To understand how this integration is performed over the final states, we 
shall first consider a concrete example. 

We shall discuss the problem of the scattering of a spinless particle of mass m 
by a potential W(r) (cf. chap. VIII). The state | y(t) > of the particle at time ¢ can 


* For this theory to be meaningful, it is obviously necessary for conditions (C-16) and (C-18) to 
be compatible. That is, we must have: 
1 h 
E < ——— 
[oy] > [Wal 


This inequality means that the energy difference |E, — E| = h|w,,| is much larger than the matrix 
element of W(t) between | p, > and | p, >. 
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be expanded on the states | p > of well-defined momenta p and energies : 


2 
pe. (C-19) 
2m 


The corresponding wave functions are the plane waves: 


/ 1 3/2 
— ip- rh ES 
crip> = (53) e (C-20) 


The probability density associated with a measurement of the momentum is 
IX p | H(t) >|? [| w(1) > is assumed to be normalized]. 

The detector used in the experiment (see, for example, figure 2 of chapter VHI) 
gives a signal when the particle is scattered with the momentum p,. Of course, 
this detector always has a finite angular aperture, and its energy selectivity is not 
perfect : it emits a signal whenever the momentum p of the particle points within 
a solid angle 62, about p, and its energy is included in the interval OE, centered 
at E, = p;/2m. If D, denotes the domain of p-space defined by these conditions, 
the probability of obtaining a signal from the detector is therefore: 


¿Pp t) = | pl MOR (C-21) 


To use the results of the preceding sections, we shall have to perform a change of 
variables which results in an integral over the energies. This does not present any 
difficulties, since we can write : 


Pp = p dpdQ (C-22) 


and replace the variable p by the energy E, to which it is related by (C-19). We thus 
obtain: 


Pp = p(E)dEdQ (C-23) 


where the function p(£), called the density of final states, can be written, according 
to (C-19), (C-22) and (C-23): 


dp m a 
O IE A 
AE) = Pq = PO my 2mE (C-24) 
(C-21) then becomes: 
6Ap,.1) = | dO dE p(E) |< p | v(t) >? (C-25) 
E62 
EeoE, 


B. The general case 


Assume that, in a particular problem, certain eigenstates of H, are labeled 
by a continuous set of indices, symbolized by a, such that the orthonormalization 
relation can be written: 


<a |a > = dla — a) (C-26) 
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The system is described at time ¢ by the normalized ket | W(t) >. We want to calculate 
the probability 6P(x,. t) of finding the system, in a measurement, in a given group 
of final states. We characterize this group of states by a domain D, of values of the 
parameters a, centered at «,, and we assume that their energies form a continuum. 
The postulates of quantum mechanics then yield: 


dPla,, t) = | dx |< a | w(t) >|? (C-27) 
zeDy 


As in the example of §a above, we shall change variables, and introduce the 
density of final states. Instead of characterizing these states by the parameters «, 
we shall use the energy E and a set of other parameters f (which are necessary 
when Ho alone does not constitute a C.S.C.O.). We can then express da in terms 
of dE and dp: 


dx = p(B. Edf dE (C-28) 


in which the density of final states p(B, E)* appears. If we denote by 0f, and JE, 
the range of values of the parameters fp and E defined by Dp, we obtain: 


szan = | dp dE p(B, E) |< B, E | we) X|? (C-29) 
Bedpy 

EeðE; 
where the notation | « > has been replaced by | $, E > in order to point up the £- and 
f-dependence of the probability density |< x | y (£) >|?. 


b. FERMIS GOLDEN RULE 


In expression (C-29), | w(+) > is the normalized state vector of the system 
at time f. As in $A of this chapter, we shall consider a system which is initially 
in an eigenstate | @; > of Ho [| p;> therefore belongs to the discrete spectrum 
of Ho. since the initial state of the system must, like |w(1) >. be normalizable |. 
In (C-29). we shall replace the notation 8P (ap, t) by 92 (9;, xp. 1) in order to 
remember that the system starts from the state ie. ` 

The calculations of $B and their application to the case of a sinusoidal or 
constant perturbation ($$C-1 and C-2) remain valid when the final state of the 
system belongs to the continuous spectrum of H,. If we assume W to be constant, 
we can therefore use (C-6) to find the probability density |< f, E | w(r) >|? to first 
order in W. We then get: 


2 als > 2 E E; a 
A oo (E ) (C-30) 


* In the general case, the density of states p depends on both E and 8. However, it often happens 
(cf. example of §& above) that p depends only on E. 
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where E and E, are the energies of the states | £, E > and | o; > respectively, and F 
is the function defined by (C-7). We get for ôP (p;, ap, t), finally: 


E — E; 


dB AE AB, EK B. E| W | o> P (157E) 31) 


The function F| t, a ) varies rapidly about E = E, (cf. fig. 4). If £ is sufficiently 


h 
large, this function can be approximated, to within a constant factor, by the 
function 6(E — E;), since, according to (11) and (20) of appendix II, we have: 


Lim F(« 2) = m0( 55) = 2nht (E — E) (C-32) 


t> æ 


On the other hand, the function p(B, E) |< B, E| W | @; >|? generally varies much 
more slowly with E. We shall assume here that ¢ is sufficiently large for the variation 
of this function over an energy interval of width 4zh/t centered at E = E, to be 


negligible*. We can then in (C-31) replace F{ t, A a by its limit (C-32). This 
enables us to perform the integral over E immediately. If, in addition, 68, is very 


small, integration over ß is unnecessary, and we finally get: 


— when the energy E, belongs to the domain 6E,: 
2n 
OP(Q;, f, t) = öbs tl (By, E, = E, | W| P;) |? p(B,, E, = Ej) (C-33-a) 


— When the energy E, does not belong to this domain : 
ÖP (Pp; Ast) = 0 (C-33-b) 


As we saw in the comment of §C-2-c-«, a constant perturbation can induce 
transitions only between states of equal energies. The system must have the same 
energy (to within 27h/t) in the initial and final states. This is why, if the domain JE, 
excludes the energy £;, the transition probability is zero. 

The probability (C-33-a) increases linearly with time. Consequently, the 
transition probability per unit time, 9W'(P;. %,), defined by: 


is d 
OW (Pi Xp) = dt PQ; Ap, t) (C-34) 


is time-independent. We introduce the transition probability density per unit time and 
per unit interval of the variable 6 ,: 


w(p, ap) = Mgaa (C-35) 


* p(B, E)K B, E| W | @, >|? must vary slowly enough to enable the finding of values of t 
which satisfy the stated condition but remain small enough for the perturbation treatment of W to be 
valid. Here, we also assume that ôE, > 4ah/t. 
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It is equal to: 


| o 7 
| 2n 
| w(P;, ay) =h E Br E, = E, | W | 0; >|? PAB ys E, = E) (C-36) 


This important result is known as Fermi's golden rule. 


COMMENTS: 


Assume that W is a sinusoidal perturbation of the form (C-1-a) or (C-1-b), 
which couples a state | p; > to a continuum of states | Py, E, > with energies E, 
close to E, + hw. Starting with (C-11), we can carry out the same procedure 
as above, which yields: 


n 
WP; a) = mI Bp Es = E, +ho| W| o:>) plp Ef = E, +ho) (C-37) 


Let us return to the problem of the scattering of a particle by a potential W whose matrix 
elements in the { |r > } representation are given by: 


<r| wW 


r >= Wir) d(r — r’) (C-38) 
Now assume that the initial state of the system is a well-defined momentum state: 

|y = 0)> = |p;> (C-39) 
and we shall calculate the scattering probability of an incident particle of momentum p, 
into the states of momentum p grouped about a given value p, (with |p,| = [p,|). 


(C-36) gives the scattering probability w(p,, py) per unit time and per unit solid angle 
about p = p,;: 


2 
mp. Py) = F K ps [W | pe PP OLE, = E) (C-40) 


Taking into account (C-20), (C-38) and expression (C-24) for p(£), we then get: 


2 


| dir eles Ps Ih W(r) (C-41) 


2n — (1 NV 
/ 
w(P; Py) = PA 2mE; (5) 


On the right-hand side of this relation, we recognize the Fourier transform of the 
potential W(r), evaluated for the value of p equal to p; — py. 

Note that the initial state |P;> chosen here is not normalizable, and it cannot 
represent the physical state of a particle. However, although the norm of | p, > is infinite, 
the right-hand side of (C-41) maintains a finite value. Intuitively, we can therefore expect 
to obtain a correct physical result from this relation. If we divide the probability obtained 
by the probability current: 


3 3 | 
1 \ hk, 1 IDE, 

==.) ~= = [— 127 (C-42) 
2nh} m 2h) N m 
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associated, according to (C-20), with the state | p, >. we obtain: 


2 


2 
w(p; Py) A [er eii- psih Wp) (C-43) 
J; 4n*h* 


t 


which is the expression for the scattering cross section in the Born approximation ($B-4 


of chap. VIII). 
Although it is not rigorous, the preceding treatment enables us to show that the 


scattering cross sections of the Born approximation can also be obtained by a time- 
dependent approach, using Fermi’s golden rule. 


References and suggestions for further reading: 


Perturbation series expansion of the evolution operator: Messiah (1.17), 
chap. XVII, $51 and 2. 

Sudden or adiabatic modification of the Hamiltonian: Messiah (1.17), chap. XVII, 
$ IL; Schiff (1.18), chap. 8. $35. 

Diagramatic representation of a perturbation series (Feynman diagrams) : 
Ziman (2.26), chap. 3: Mandl (2.9), chaps. 12 to 14; Bjorken and Drell (2.10), chaps. 16 
and 17. 
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COMPLEMENTS OF CHAPTER XIII 


Axm: 


Bym: 


Cx: 


Dxm: 


Exam! 


INTERACTION OF AN ATOM 
WITH AN ELECTROMAGNETIC WAVE 


LINEAR AND NON-LINEAR RESPONSE 
OF A TWO-LEVEL SYSTEM SUBJECTED 
TO A SINUSOIDAL PERTURBATION 


OSCILLATIONS OF A SYSTEM 
BETWEEN TWO DISCRETE STATES 
UNDER THE EFFECT 

OF A RESONANT PERTURBATION 


DECAY OF A DISCRETE STATE 
RESONANTLY COUPLED TO A CONTINUUM 
OF FINAL STATES 


EXERCISES 


Axı: illustration of the general considerations of 
$C-2 of chapter XIII, using the important example 
of an atom interacting with a sinusoidal electro- 
magnetic wave. Introduces fundamental concepts 
such as: spectral line selection rules, absorption 
and induced emission of radiation, oscillator 
strength... Although moderately difficult, can be 
recommended for a first reading because of the 
importance of the concepts introduced. 


Bi: a simple model for the study of some 
non-linear effects which appear in the interaction 
of an electromagnetic wave with an atomic 
system (saturation effects, multiple-quanta tran- 
sitions,...). More difficult than Axm (graduate 
level); should therefore be reserved for a sub- 
sequent study. 


Cx Study of the behavior, over a long time 
interval. of a system which has discrete energy 
levels, subjected to a resonant perturbation. 
Completes, in greater detail, the results of $C-2 
of chapter XII], which are valid only for short 
times. Relatively simple. 


Dx: Study of the behavior, over a long time 
interval, of a discrete state resonantly coupled 
to a continuum of final states. Completes, in 
greater detail, the results obtained in $C-3 of 
chapter XII (Fermi's golden rule), which were 
established only for short time intervals. Proves 
that the probability of finding the particle in the 
discrete level decreases exponentially and justifies 
the concept of lifetimes introduced phenomeno- 
logically in complement Kı Important for its 
numerous physical applications; graduate level. 


Ex: exercise 10 can be done at the end of comple- 
ment A xp; it is a step-by-step study of the effect of 
the external degrees of freedom of a quantum 
mechanical system on the frequencies of the elec- 
tromagnetic radiation it can absorb (the Doppler 
effect, recoil energy, the Móssbauer effect). 

Certain exercises (especially 8 and 9) are more 
difficult than those of other complements, but 
treat important physical phenomena. 


ee e A E 5 o o o 5 5 5 o 5 5 
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Complement Axm 


THE INTERACTION OF AN ATOM 
WITH AN ELECTROMAGNETIC WAVE 


1. The interaction Hamiltonian. Selection rules 


a. Fields and potentials associated with a plane electromagnetic wave 
b. The interaction Hamiltonian at the low-intensity limit 

c. The electric dipole Hamiltonian 

d. The magnetic dipole and electric quadrupole Hamiltonians 


2. Non-resonant excitation. Comparison with the elastically bound electron model 


a. Classical model of the elastically bound electron 
b. Quantum mechanical calculation of the induced dipole moment 
c. Discussion. Oscillator strength . 


3. Resonant excitation. Induced absorption and emission 


a. Transition probability associated with a monochromatic wave 
b. Broad-line excitation. Transition probability per unit time 


In §C of chapter XIII, we studied the special case of a sinusoidally time-dependent 
perturbation : W(t) = W sin wt. We encountered the resonance phenomenon 
which occurs when w is close to one of the Bohr angular frequencies 
wp; = (E, — E,)/h of the physical system under consideration. 

A particularly important application of this theory is the treatment of an atom 
interacting with a monochromatic wave. In this complement, we will use this 
example to illustrate the general considerations of chapter XIII and to point out 
certain fundamental concepts of atomic physics such as spectral line selection rules, 
induced absorption and emission of radiation, oscillator strength, etc... 

As in chapter XIII, we shall confine ourselves to first-order perturbation 
calculations. Some higher-order effects in the interaction of an atom with an 
electromagnetic wave (“non-linear” effects) will be taken up in complement B,,,. 

We shall begin (§1) by analyzing the structure of the interaction Hamiltonian 
between an atom and the electromagnetic field. This will permit us to isolate the 
electric dipole, magnetic dipole and electric quadrupole terms, and to study the 
corresponding selection rules. Then we shall calculate the electric dipole moment 
induced by a non-resonant incident wave (§2) and compare the results obtained 
with those of the model of the elastically bound electron. Finally, we shall 
study (§3) the processes of induced absorption and emission of radiation which 
appear in the resonant excitation of an atom. 
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1. The interaction Hamiltonian. Selection rules 


a. FIELDS AND POTENTIALS ASSOCIATED 
WITH A PLANE ELECTROMAGNETIC WAVE 


Consider a plane electromagnetic wave*, of wave vector k (parallel to Oy) and 
angular frequency œw = ck. The electric field of the wave is parallel to Oz and the 
magnetic field, to Ox (fig. 1). 


FIGURE 1 


The electric field E and magnetic field B of a plane wave 
of wave vector k. 


For such a wave, it is always possible, with a suitable choice of gauge 
(cf. appendix III, §4-b-«), to make the scalar potential U(r, t) zero. The vector 
potential A(r, t) is then given by the real expression: 


Alr, t) = Zoe, eiD 4 ft e 0 HO) (1) 


where , is a complex constant whose argument depends on the choice of the 
time origin. We then have: 


E(r, t) = — Żak, t) = iws g e, e972 — ind e, e Yon (2) 
B(r, 1) =V x A(r, t) = ike, e, e 097%) — ik$ e, e ben (3) 


We shall choose the time origin such that the constant £o is pure imaginary, and 
we set: 


$ (4-a) 
TET EM 5 (4-b) 


* For the sake of simplicity, we shall confine ourselves here to the case of a plane wave. The 
results obtained in this complement, however, can be generalized to an arbitrary electromagnetic field. 
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where & and 4 are two real quantities such that: 


We then obtain: 


E(r, t) = & e_ cos (kv — wt) (6) 
Bir, t) =% e, cos (Ay — wt) (7) 


é and 4 are therefore the amplitudes of the electric and magnetic fields of the plane 
wave considered. 

Finally, we shall calculate the Poynting vector* G associated with this plane 
wave: 


G = 6c? Ex B (8) 


Replacing E and B in (8) by their expressions (6) and (7), and taking the time- 
average value over a large number of periods, we obtain, using (5): 


= E? 
G= bol ey (9) 


b. THE INTERACTION HAMILTONIAN AT THE LOW-INTENSITY LIMIT 


The preceding wave interacts with an atomic electron (of mass m and 
charge q) situated at a distance r from O and bound to this point O by a central 
potential V(r) (created by a nucleus assumed to be motionless at O). The quantum 
mechanical Hamiltonian of this electron can be written: 


H =>, [P — ¿A(R, 1)]? + V(R) — £S . BR, 1) (10) 


The last term of (10) represents the interaction of the spin magnetic moment of the 
electron with the oscillating magnetic field of the plane wave. A(R, ż) and B(R, 1) 
are the operators obtained by replacing, in the classical expressions (1) and (3), 
x, y, z by the observables X, Y, Z. 

In expanding the square which appears on the right-hand side of (10), we 
should, in theory, remember that P does not generally commute with a function of R. 
Such a precaution is, however, unnecessary in the present case, since, as A is 
parallel to Oz [formula (1)], only the P, component enters into the double product; 
now P. commutes with the Y component of R, which is the only one to appear in 
expression (1) for A(R, 1). We can then take: 


H = Ho + W(t) (11) 


* Recall that the energy flux across a surface element dS perpendicular to the unit vector a 
is G.ndS. 
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where: 
Ho. =~ + V(R) (12) 
is the atomic Hamiltonian, and: 


_ _4 q q A 

W(t) = — p P A(R, t) — 7,8 - BIR, t) +57 TA(R, t)] (13) 
is the interaction Hamiltonian with the incident plane wave [the matrix elements 
of W(t) approach zero when ., approaches zero]. 

The first two terms of the right-hand side of (13) depend linearly on o, and 
the third one depends on it quadratically. With ordinary light sources, the intensity 
is sufficiently low that the effect of the .4% term can be neglected compared to that 
of the £a term. We shall therefore write: 


W(t) = W(t) + Wilt) (14) 
with: 

Wit) = - <P. A(R, t) (15) 

Wilt) = — 2S. BR, t) (16) 


We shall evaluate the relative orders of magnitude of the matrix elements 
of W, (1) and W, (t) between two bound states of the electron. Those of S are of the 
order of h, and B is of the order of k.%, [cf. formula (3)]. Thus: 


q 

— hka 
Walt) Po m : = hk (17) 
W(t) q p 

m PA o 


According to the uncertainty relations, h/p is, at most, of the order of atomic 
dimensions (characterized by the Bohr radius, a, = .5 A). k is equal to 2z/A, 
where A is the wavelength associated with the incident wave. In the spectral domains 
used in atomic physics (the optical or Hertzian domains), 4 is much greater than ap, 
so that: 


Wilt) _ ao 
~— <i 18 
wi) ~ $ (18) 
c. THE ELECTRIC DIPOLE HAMILTONIAN 
x. The electric dipole approximation. Interpretation 


Using expression (1) for A(R, 1), we can put W (r) in the form: 


A E + ot eT e] (19) 
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Expanding the exponential e+'** 


in powers of KY: 

et*¥ — 14 ikY eS +... (20) 
Since Y is of the order of atomic dimensions, we have, as above: 

ky ~ 21 (21) 


We therefore obtain a good approximation for W, by retaining only the first term 
of expansion (20). Let W; be the operator obtained by replacing e+” by 1 on the 
right-hand side of (19). Using (4-a), we get: 


Worlt) = ze P, sin œt (22) 


Wpe(t)is called the “electric dipole Hamiltonian”. The electric dipole approximation, 
which is based on conditions (18) and (21), therefore consists of neglecting W,,(t) 
relative to W,(t) and identifying W;(t) with W,,,(¢): 


W(t) = Woglt) (23) 


Let us show that, if we replace W(t) by W,,(t), the electron oscillates as 
if it were subjected to a uniform sinusoidal electric field $ e, cos wt, whose amplitude 
is that of the electric field of the incident plane wave evaluated at the point O. 
Physically, this means that the wave function of the bound electron is too localized 
about O for the electron to “feel” the spatial variation of the electric field of the 
incident plane wave. We shall therefore calculate the evolution of < R >(t). 
Ehrenfest's theorem (cf. chap. III, $ D-1-d) gives: 


á oe PD 
q; < R> = 7% <LR, Ho + Woa)] > = = + e Sin wt os 
d 1 
= <P> =-=<[P,Ho + Wog] > = — < VV(R) > 
dt ih 
Eliminating < P > from these two equations, we obtain, after a simple calculation: 
2 
mo (RY = ~ < VV(R) > + qê e, cos wt (25) 
t 


which is indeed the predicted result: the center of the wave packet associated with 
the electron moves like a particle of mass m and charge g which is subject to both 
the central force of the atomic bond [the first term on the right-hand side of (25)] 
and the influence of a uniform electric field [the second term of (25)]. 


COMMENT: 


Expression (22) for the electric dipole interaction Hamiltonian seems rather 
unusual for a particle of charge q interacting with a uniform electric field E = £ e, cos wt. 
We would tend to write the interaction Hamiltonian in the form: 


Wot) = — D.E = — q&Z cos wt (26) 


where D = gR is the electric dipole moment associated with the electron. 
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Actually, expressions (22) and (26) are equivalent. We shall show that we can go 
from one to the other by a gauge transformation (which does not modify the physical 
content of the quantum mechanics; cf. complement H,,;). The gauge used to obtain (22) 
is: 


A(r, t) = Le, sin (ky — wt) (27-a) 
Ur, 1) = 0 (27-b) 


[to write (27-a), we have replaced «J, by &/2iw in (1); cf. formula (4-a)]. Now consider the 
gauge transformation associated with the function: 


é 
ar, t) = 2 sin wt (28) 


Thus, we introduce a new gauge { A’, U’ | characterized by: 


E ; 
A' = A + Vy = e,— [sin (ky — wt) + sin wt] (29a) 
w 
ox 
U' = U — =) = — 26 cos wt (29b) 


The electric dipole approximation amounts to replacing ky by 0 everywhere. We 
then see that in this approximation: 


y 


6 
A’ = e,— [sin (— wt) + sin wt] = 0 (30) 
w 


If, in addition, we neglect, as we did above, the magnetic interaction terms related to the 
spin, we obtain, for the system’s Hamiltonian : 


1 
H' = mE — qa Y? + V(R) +qU'(R, t) 


p? 
= — +V(R) + qUí(R, £ 
= + VIR) + qU'R, f) 
= Ho + W'(t) (31) 
where H, is the atomic Hamiltonian given by (12), and: 
W'(t) = qU(R, 1) = — qZ € cos wt = Wolt) (32) 
is the usual form (26) of the electric dipole interaction Hamiltonian. 

Recall that the state of the system is no longer described by the same ket when 
we go from gauge (27) to gauge (29) (cf. complement Hu). The replacement of Wpglt) 
by Wielt) must therefore be accompanied by a change of state vector, the physical 
content, of course, remaining the same. 


In the rest of this complement, we shall continue to use gauge (27). 
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b. The matrix elements of the electric dipole Hamiltonian 


Later, we shall need the expressions for the matrix elements of W,, 
between | p,> and | p f >» eigenstates of Ho of eigenvalues E, and E,. According 
to (22), these matrix elements can be written: 


< 9, | Wolt) pi> = SS sin ot <p, | P.o: (33) 


It is simple to replace the matrix elements of P, by those of Z on the right- 
hand side of (33). Insofar as we are neglecting all magnetic effects in the atomic 
Hamiltonian [cf. expression (12) for Ho], we can write: 


4 CH 
[Z, Ho] = iñ AP. 


= ih (34) 
which yields: 
< ps |[Z, Ho]|0:> =<p,1ZH, — HoZ| 9; > 
= — (E, — E)< 9,|Z|9> Eo. 00 (35) 
Introducing the Bohr angular frequency w,; = (E, — E;)/h.we then get : 


Cpr |P lpi) = imwon o|o? (36) 


and, consequently : 
Orp: 
< Pr | Wolt) | pi> = ig" 8 sinot< os| Z| 9% > (37) 
The matrix elements of Wpg(t) are therefore proportional to those of Z. 


COMMENT: 


It is the matrix element of Z which appears in (37) because we have 
chosen the electric field E(r, £) parallel to Oz. In practice, we may be led to 
choose a frame Oxyz which is related, not to the light polarization, but to the 
symmetry of the states | g; > and | p, >. For example, if the atoms are placed 
in a uniform magnetic field B,, the most convenient quantization axis for the 
study of their stationary states | p, > is obviously parallel to By. The pola- 
rization of the electric field E(r, z) can then be arbitrary relative to Oz. In 
this case, we must replace the matrix element of Z in (37) by that of a linear 
combination of X, Y and Z. 


y. Electric dipole transition selection rules 


If the matrix element of Wpg between the states | p; > and | p, > is different 
from zero, that is, if < p,|Z | p, > is non-zero*, the transition | p, > —> | py > 
is said to be an electric dipole transition. To study the transitions induced 


* Actually, it suffices for one of the three numbers < p,|[Z |p,>,<p¿|X|p,>or<o,| Y |o: 
to be different from zero (cf. comment of $ B above). 
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between | o; > and | p, > by the incident wave, we can then replace W(t) by Wpelt). 
If, on the other hand, the matrix element of W,,(t) between | p; > and | p,> is 
zero, we must pursue the expansion of W(t) further, and the corresponding 
transition is either a magnetic dipole transition or an electric quadrupole transi- 
tion, etc... * (see following sections). Since W,,(t) is much larger than the 
subsequent terms of the power series expansion of W(t) in a,//, electric dipole 
transitions will be, by far, the most intense. In fact, most optical lines emitted by 
atoms correspond to electric dipole transitions. 


Let: 
PnitamlE) = Ro, (1) Yr O. o) (38) 
Pnptym (T) = Rpa 10) YT (0, o) 


be the wave functions associated with | p, > and | p, >. Since: 
An vo 
z = rcos0 = 3" Y (0) (39) 


the matrix element of Z between | y, > and | p, > is proportional to the angular 
integral: 


| dQ yr *(0, p) Y 90) ¥™(0, p) (40) 


According to the results of complement C,, this integral is different from zero 
only if: 

lp=1,+1 (41) 
and: 


(42) 


Actually, it would suffice to choose another polarization of the electric field (for 
example, parallel to Ox or Oy; see comment of 8f) to have: 


mp=m,+1 (43) 


From (41), (42) and (43), we obtain the electric dipole transition selection rules: 


| Ai=i,—-t= +1 (44-a) 


| Am=m,—m,= —1,0,+ 1 (44-b) 


f 


* It may happen that all the terms of the expansion have zero matrix elements. The transition 
is then said to be forbidden to all orders (it can be shown that this is always the case if | o; > and | o; > 
both have zero angular momenta). 
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COMMENTS: 


(i) Z is an odd operator. It can connect only two states of different parities. 
Since the parities of | p, > and | p, > are those of /; and /,, Al = l, — l; must 
be odd, as is compatible with (44-a). 


(ii) If there exists a spin-orbit coupling ¿(rJL . S between L and S (cf. chap. XII, $ B-1-b-B), 
the stationary states of the electron are labeled by the quantum numbers /, s, J, m, (with 
J = L + S). The electric dipole transition selection rules can be obtained by looking for 
the non-zero matrix elements of R in the { | Z, s, J, m; > } basis. By using the expansions 
of these basis vectors on the kets |/, m > |s, ms > (cf. complement Ax, $ 2), we find, 
starting with (44-a) and (44-b), the selection rules: 


AJ = 0, + 1 (44-c) 
A =+1 (44-d) 
Am, =0,+1 (44-e) 


Note that a AJ = 0 transition is not forbidden [unless J, = J, = 0; cf. note on the 
preceding page]. This is due to the fact that J is not related to the parity of the level. 

Finally, we point out that selection rules (44-c, d, e) can be generalized to many- 
electron atoms. 


d. THE MAGNETIC DIPOLE AND ELECTRIC QUADRUPOLE HAMILTONIANS 


a. Higher-order terms in the interaction Hamiltonian 

The interaction Hamiltonian given by (14) can be written in the form: 

W(t) = Wilt) + Wilt) = Welt) + CAC) — Welt) + Wilt) (45) 
Thus far, we have studied W,,(t). As we have seen, the ratio of W(t) — Wpelt) 
and W,,(t) to W,,(¢) is of the order of a/A. 

= To calculate W,(t) — Wiglt), we simply replace e*" by 

et *Y _ ] ~ + ikY +... in (19), which yields: 

W(t) — Walt) = — 4 [ik A y e7 — klk "| PLY +... (46) 
or, using (4-b): 


W(t) — Welt) = — ig cos wtP,Y +.. (47) 


If we write P, Y in the form: 


1 1 1 1 
PY =5(P,Y — ZP) +5(P,Y + ZP,)=5L, +>(P,Y +ZP,) (48) 
we obtain, finally : 
q q 
Wit) = Worlt) = — Fm L, 2 COS wt — Fm B cos atl YP, + ZP,] +. (49) 
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In the expression for W,,(t) [ formulas (16) and (3)], it is entirely justified 


to replace e+" by 1. We thus obtain a term of order a,/A relative to W,(t), that is, 
of the same order of magnitude as W, (1) — Wpelt): 


W(t) = — 4 S, B cos ot +... (50) 


Substituting (49) and (50) into (45) and grouping the terms differently, we 
obtain: 


W(t) = Woe(t) + Woult) + Welt) + -.. (51) 
with: 
Wm == A (L, +28,) 8 cos wt (52) 
2m 
Wor = — = (YP, + ZP,) & cos at (53) 


[we have replaced 2 by £/c in (53)]. Woy and Mos (which are, a priori, of the same 
order of magnitude) are, respectively, the magnetic dipole and electric quadrupole 
Hamiltonians. 


PB. Magnetic dipole transitions 


The transitions induced by Wpm are called magnetic dipole transitions. 
Wu represents the interaction of the total magnetic moment of the electron with 
the oscillating magnetic field of the incident wave. 

The magnetic dipole transition selection rules can be obtained by considering 
the conditions which must be met by | g,; > and | p, > in order for Woy to have 
a non-zero matrix element between these two states. Since neither L, nor S, changes 
the quantum number /, we must have, first of all, 4/= 0. L, changes the 
eigenvalue m, of L, by + 1, which gives Am, = + 1. S, changes the eigenvalues ms 
of S, by + l, so that Ams = + 1. Note, furthermore, that if the magnetic field 
of the incident wave were parallel to Oz, we would have 4m, = 0 and Am, = 0. 
Grouping these results, we obtain the magnetic dipole transition selection rules: 


Al = 0 
Am, = + 1,0 (54) 
Am, = + 1,0 


COMMENT: 


In the presence of a spin-orbit coupling, the eigenstates of H, are labeled by the 
quantum numbers / and J. Since L, and S, do hot commute with J?, W,,, can connect 
states with the same / but different J. By using the addition formulas for an angular 
momentum / and an angular momentum 1/2 (cf. complement Az, §2), it can easily be 
shown that selection rules (54) become: 


Al=0 
AJ = + 1,0 (55) 
Am, = + 1,0 
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Note that the hyperfine transition F = 0 <=> F = t of the ground state of the 
hydrogen atom (cf. chap. XII, $ D) is a magnetic dipole transition, since the components 
of S have non-zero matrix elements between the states of the F = 1 level and the 
| F = 0, mp = 0 > state. 


y. Electric quadrupole transitions 


Using (34), we can write: 


YP, + ZP, = YP, + P,Z =- { Y[Z, Ho] + [Y, HolZ} 


(56) 
= 5 (YZH, — H,YZ) 
from which we obtain, as in (36): 
< py | Woelt)| > => wn< pr] YZ |o: > 6 cos cut (57) 


The matrix element of Wo, (1) is therefore proportional to that of YZ, which 
is a component of the electric quadrupole moment of the atom (cf. complement Ex). 
In addition, the following quantity appears in (57): 


Ari Pg OE es qlke (58) 
C wm C 109) 


which, according to (2), is of the order of q08,/0y. Wo,(t) can therefore be interpreted 
as the interaction of the electric quadrupole moment of the atom with the gradient* 
of the electric field of the plane wave. 

To obtain the electric quadrupole transition selection rules, we simply note 
that, in the { | r > } representation, YZ is a linear superposition of r?Y3(0. p) 
and r?Y7*'(0, y). Therefore, in the matrix element < p, | YZ | q, > there appear 
angular integrals : 


| dQ Yr (0, o) YF "0, OP(O, p) (59) 


which, according to the results of complement Cy. are different from zero only 
if Al = 0, + 2 and dm = + 1. This last relation becomes Am = + 2, + 1,0 when 
we consider an arbitrary polarization of the incident wave (cf. comment of $1-c-B), 
and the electric quadrupole transition selection rules can be written, finally: 


Al=0, +2 
nE (60) 
Am=0,+1,+2 


* It is normal for the electric field gradient to appear, since Mog(t) was obtained by expanding 
the potentials in a Taylor series in the neighborhood of O. 
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COMMENTS: 


(i) Woy and Wo, are even operators and can therefore connect only states of the 
same parity. which is compatible with (54) and (60). For a given transition, 
Woy and Wo, are never in competition with W,,. This facilitates the 
observation of magnetic dipole and electric quadrupole transitions. 
Most of the transitions which occur in the microwave or radio- 
frequency domain — in particular, magnetic resonance transitions (cf. comple- 
ment F,,) — are magnetic dipole transitions. 


(ii) For a A/=0, Am = 0, + I transition, the two operators W,, and Wo, 
simultaneously have non-zero matrix elements. However, it is possible to find 
experimental conditions under which only magnetic dipole transitions are 
induced. All we need to do is place the atom, not in the path of a plane wave, 
but inside a cavity or radiofrequency loops, at a point where B is large but 
the gradient of E is negligible. 


(iii) Fora Al = 2 transition, W,,, cannot be in competition with Woy, and we have 
a pure quadrupole transition. As an example of a quadrupole transition, 
we can mention the green line of atomic oxygen (5 577 A ), which appears 
in the aurora borealis spectrum. 


(iv) If we pursued the expansion of e*” further, we would find electric octupole 
and magnetic quadrupole terms, etc. 


In the rest of this complement, we shall confine ourselves to electric dipole 
transitions. In the next complement, Byin, on the other hand, we shall consider 
a magnetic dipole transition. 


2. Non-resonant excitation. 
Comparison with the elastically bound electron model 


In this section, we shall assume that the atom, initially in the ground 
state | p, >, is excited by a non-resonant plane wave: œ coincides with none of 
the Bohr angular frequencies associated with transitions from | po >. 

Under the effect of this excitation, the atom acquires an electric dipole 
moment < D >(+) which oscillates at the angular frequency w (forced oscillation) 
and is proportional to & when @ is small (linear response). We shall use perturbation 
theory to calculate this induced dipole moment, and we shall show that the results 
obtained are very close to those found with the classical model of the elastically 
bound electron. 

This model has played a very important role in the study of the optical 
properties of materials. It enables us to calculate the polarization induced by the 
incident wave in a material. This polarization, which depends linearly on the 
field &, behaves like a source term in Maxwell’s equations. When we solve these 
equations, we find plane waves propagating in the material at a velocity different 
from c. This allows us to calculate the refractive index of the material in terms of 
various characteristics of elastically bound electrons (natural frequencies, number 
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per unit volume, etc.). Thus, we see that it is very important to compare the 
predictions of this model (which we shall review in §a) with those of quantum 
mechanics. 


a. CLASSICAL MODEL OF THE ELASTICALLY BOUND ELECTRON 


a. Equation of motion 


Consider an electron subject to a restoring force directed towards the point O 
and proportional to the displacement. In the classical Hamiltonian corresponding 
to (12), we then have: 


V(r) = 5 mos r? (61) 


where @, is the electron’s natural angular frequency. 

If we make the same approximations, using the classical interaction 
Hamiltonian, as those which enabled us to obtain expression (22) for W,,(t) (the 
electric dipole approximation) in quantum mechanics, a calculation similar to that 
of §l-c-x [ef. equation (25)] yields the equation of motion: 

2 
e + ot 2 = cos ot (62) 
dt? m 


This is the equation of a harmonic oscillator subject to a sinusoidal force. 


B. General solution 


The general solution of (62) can be written: 


z = A cos (wot — P) + > COS wt (63) 


mu — w 


where A and q are real constants which depend on the initial conditions. The 
first term of (63), A cos (wt — q), represents the general solution of the homo- 
geneous equation (the electron's free motion). The second term is a particular 
solution of the equation (forced motion of the electron). 

We have not, thus far, taken damping into account. Without going into 
detail in the calculations, we shall cite the effects of weak damping: after a certain 
time 7, it causes the natural motion to disappear and very slightly modifies the 
forced motion (provided that we are sufficiently far from resonance: |œ — wo| > 1/1). 
We shall therefore retain only the second term of (63): 


& COS wt 
m(w — w°) 
COMMENT: 
Far from resonance, the exact damping mechanism is of little importance, provided 


that it is weak. We shall not, therefore, take up the problem of the exact description of 
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this damping, either in quantum or in classical mechanics. We shall merely use the fact 
that it exists to ignore the free motion of the electron. 

It would be different for a resonant excitation: the induced dipole moment would 
then depend critically on the exact damping mechanism (spontaneous emission, thermal 
relaxation, etc.). This is why we shall not try to calculate < D >(¢) in $ 3 (the case of a reso- 
nant excitation). We shall be concerned only with calculating the transition probabilities. 

In complement Bym, we shall study a specific model of a system placed in an 
electromagnetic wave and at the same time subject to dissipative processes (Bloch 
equations of a system of spins). We shall then be able to calculate the induced dipole 
moment for any exciting frequency. 


y. Susceptibility 


Let 9 = qz be the electric dipole moment of the system. According to (64), 
we have: 


p= qz = 5 & cos wt = x $ cos wt (65) 


oe: oe (66) 


b. QUANTUM MECHANICAL CALCULATION OF THE INDUCED DIPOLE MOMENT 


We shall begin by calculating, to first order in $, the state vector | w(t) > 
of the atom at time 1. We shall choose for the interaction Hamiltonian, the electric 
dipole Hamiltonian W,, given by (22). In addition, we shall assume that: 


| w(t = 0)> = | > (67) 


We apply the results of $C-1 of chapter XIII, replacing W,; by ze On | P: |p: 
and | p; > by | Po >. Thus we obtain*: 


| W(t) > =e 2 | o> + L ABP eT E] oa> (68) 


or, using (C-4) of chapter XIII and multiplying | (rt) > by the global phase 
factor e'Eo'/A which has no physical importance: 


ul) > =| Po? + Y nina l Ol Paleo) 


e7 eno e pio! Cr eee —iwnot __ 7 tot 
— — 69 
Sl Ono +O Ono a fe (69) 


* Since Woz is odd, < @ | Wog(t) | Po > is zero, so b§(1) = 0 
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From this, we find < y(t) | and < D, X(t) = < W(t) | 9Z | y (£) >. In the calculation 
of this mean value, we retain only the terms linear in &, and we neglect all those 
which oscillate at angular frequencies + (0, (the natural motion, which would 
disappear if we took weak damping into account). Finally, after having replaced 
< Pn | P, | Po > by its expression in terms of < 9, | Z | @ > [ef equation (36)], 
we find: 


2 Z 2 
caro = PE 6 cos wot y o Len lZ (00 FP (70) 
h n w — wm? 

c. DISCUSSION. OSCILLATOR STRENGTH 
o. The concept of oscillator strength 

We set: 

2m w Z F 
f= 2m tno |< 512] G0 > (71) 


Jo 18 a real dimensionless number, characteristic of the | pọ > <> | , > transition 
and called the oscillator strength* of this transition. If | p, > is the ground state, 
fo 18 positive, like co, y. 

Oscillator strengths satisfy the following sum rule (the Thomas-Reiche-Kuhn 
sum rule): 


Y fo = 1 (72) 


This can be shown as follows. Using (36), we can write: 
1 1 
fo =< Pol Z|O >< Pnl Pel o> = F< Gol Pel n> Pal Z| Po > (73) 


The summation over n can be performed by using the closure relation relative to the { | p, > } 
basis, and we get: 


1 
È Ino = =< Po (ZP, — P.Z)| po > = < Pol Ho > = 1 (74) 


B. The quantum mechanical justification for the elastically bound electron model 


We shall substitute definition (71) into (70) and multiply the expression so 
obtained by the number ./ of atoms contained in a volume whose linear dimensions 
are much smaller than the wavelength 4 of the radiation. The total electric dipole 
moment induced in this volume can then be written: 

2 


q 
(wo — w 


N L DA=} Nino > & cos wt (75) 
n m 


* The operator Z enters into (71) because the incident wave is linearly polarized along Oz. It 
would, however, be possible to give a general definition of the oscillator strength, independent of the 
polarization of the incident wave. 
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Comparing (75) and (65), we see that it is like having -+` classical oscillators 
[since Y 1” fo =V according to (72)] whose natural angular frequencies are 


not all the same since they are equal to the various Bohr angular frequencies of 
the atom associated with the transitions from | py >. According to (75), the pro- 
portion of oscillators with the angular frequency @, IS fro 

Thus, for a non-resonant wave, we have justified the classical model of the 
elastically bound electron. Quantum mechanics gives the frequencies of the various 
oscillators and the proportion of oscillators which have a given frequency. This 
result shows the importance of the concept of oscillator strength and enables 
us to understand a posteriori why the elastically bound electron model was so useful 
in the study of the optical properties of materials. 


3. Resonant excitation. Induced absorption and emission 


a. TRANSITION PROBABILITY ASSOCIATED WITH A MONOCHROMATIC WAVE 


Consider an atom initially in the state | p,> placed in an electromagnetic 
wave whose angular frequency is close to a Bohr angular frequency w p; 

The results of §C-1 of chapter XIII (sinusoidal excitation) are directly appli- 
cable to the calculation of the transition probability %,(t; œ). We find, using 
expression (37) (thus making the electric dipole approximation): 


Lan re ex 2u) |< Pr|Z|o >| 6 Fl — 055) (76) 
where: 
F(t,@ — wy a ome B E + (77) 


We have already discussed the resonant nature of 2 (t; œ) in chapter XIII. 
At resonance, 4 (t; œ) is proportional to 6?, that is, to the incident flux of electro- 
magnetic energy [cf. formula (9)]. 


COMMENTS: 


(i) Ifinstead of using the gauge (27), leading to the matrix element (37), we had used 
the gauge (29) leading to the Hamiltonian (32), the factor (w;;/c0)? in (76) would 
be missing. The fact that the results are different is not at all surprising. The states 
lps > and |@;>, and consequently 4,(t;@) do not have the same physical 
meaning in the two gauges. 

(ii) However, as t— œœ, the diffraction function F(t, œ — wy¡j) tends towards 
ô(w — wp), and the factor (œ riloY? approaches unity. This leads to the same 
probability density A,(t;@) in the two gauges. This result can be easily 
understood if we consider the incident electromagnetic wave to be a quasi- 
monochromatic wave packet of very large but finite spatial extent, rather than a 
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(iii) 


plane wave extending to infinity. When t > + oo the E field “seen” by the atom 
tends towards zero and the gauge transformation associated with the function x 
defined in (28) tends towards unity. Consequently | p; > and | p, > each represent 
the same physical states in the two gauges. 

Obviously, it is also possible to consider the transition probability between two 
well-defined energy states of the atomic system for a finite time interval. In this 
case, the eigenstates | o; > and | p, > of the atomic Hamiltonian Ho written in (12) 
only represent states of well-defined atomic energy (kinetic plus potential) in the 
gauge (29) where A is zero [see (30)] and p?/2m represents the kinetic energy. The 
same physical states would be represented in gauge (27) by the states exp 
[— iqx(r, )/h1|p,> and exp [— igx(r, t)/AR]|p,> respectively. For finite t, 
calculations are therefore simpler in the gauge (29). Since in the rest of this 
complement we replace F(t, œ — wp) by ó(w — wp) [see (79)], we will be 
considering the limit t > œ for which the above difficulties disappear. 


BROAD-LINE EXCITATION. TRANSITION PROBABILITY PER UNIT TIME 


In practice, the radiation which strikes the atom is very often non-mono- 


chromatic. We shall denote by 4 (w) dw the incident flux of electromagnetic energy 
per unit surface within the interval [w, œ + dw]. The variation of .4 (œw) with 
respect to w is shown in figure 2. A is the excitation line width. If 4 is infinite, we say 
that we are dealing with a “white spectrum”. 


The different monochromatic waves which constitute the incident radiation 


are generally incoherent : they have no well-defined phase relation. The total 
transition probability a, can therefore be obtained by summing the transition 
probabilities associated with each of these monochromatic waves. We must, conse- 
quently, replace 6? by 2.4(w) do/syc in (76) [formula (9)] and integrate over w. 


This gives: __ q? 
Pdt) === -|Z|e >)? 
wn 
x | ¢o(22) I(w) F(t, w — wp) (78) 
Ilw) 


FIGURE 2 


The spectral distribution of the incident flux of electromagnetic energy per unit surface. 4 is the 
width of this spectral distribution. 
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We can then proceed as in $C-3 of chapter XIII to evaluate the integral which 
appears in (78). Compared to a function of w whose width is much larger than 47/t, 
the function F(t, w — wy;) (see figure 3 of chapter XIII) behaves like ó(w — wy;). 
If ¢ is large enough to make 4x/1 < A (4: excitation line width) while remaining 
small enough for the perturbation treatment to be valid, we can, in (78), assume that: 


F(t, w — wp) = 2nt d(w — wp) (79) 
which yields: 


2 
a T 
Pit) =— A Cos 1Z | >? Flos) t (80) 
0 


We can write (80) in the form: 
P(t) = Ci, lopi) t (81) 
where: 


4n? 
Ci =X oI Z| ei Po (82) 


and « is the fine-structure constant: 


2 2 
E Sige re ae er 
* S Gn, he he 13 pe) 


This result shows that 2 (t) increases linearly with time. The transition proba- 
bility per unit time W,, is therefore equal to: 


Wis = Cis Slop) (84) 


%W;, is proportional to the value of the incident intensity for the resonance 
frequency œp; to the fine-structure constant a, and to the square of the modulus of 
the matrix element of Z, which is related [by (71)] to the oscillator strength of the 
lo, > <=> |; > transition. 

In this complement, we have considered the case of radiation propagating 
along a given direction with a well-defined polarization state. By averaging the 
coefficients C,, over all propagation directions and over all possible polarization 
states, we could introduce coefficients B,,, analogous to the coefficients C,,, defining 
the transition probabilities per unit time for an atom placed in isotropic radiation. 
The coefficients B,, (and B,;) are none other than the coefficients introduced by 
Einstein to describe the absorption (and induced emission). Thus, we see how 
quantum mechanics enables us to calculate these coefficients. 


COMMENT: 


A third coefficient, A,,, was introduced by Einstein to describe the spontaneous 
emission of a photon which occurs when the atom falls back from the upper state | y 1) 
to the lower state |g; >. The theory presented in this complement does not explain 
spontaneous emission. In the absence of incident radiation, the interaction Hamiltonian 
is zero, and the eigenstates of H, (which is then the total Hamiltonian) are stationary 
states. 
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Actually, the preceding model is insufficient, since it treats asymmetrically the 
atomic system (which is quantized) and the electromagnetic field (which is considered 
classically). When we quantize both systems, we find, even in the absence of incident 
photons, that the coupling between the atom and the electromagnetic field continues to 
have observable effects (a simple interpretation of these effects is given in complement Ky). 
The eigenstates of H, are no longer stationary states, since H, is no longer the Hamiltonian 
of the total system, and we can indeed calculate the probability per unit time of sponta- 
neous emission of a photon. Quantum mechanics therefore also enables us to obtain the 
Einstein coefficient A; 


References and suggestions for further reading: 


See, for example: Schiff (1.18), chap. 11 ; Bethe and Jackiw (1.21). Part II, chaps. 10 
and 11; Bohm (5.1), chap. 18, $812 to 44. 

For the elastically bound electron model: Berkeley 3 (7.1), supplementary 
topic 9; Feynman I (6.3), chap. 31 and Feynman II (7.2), chap. 32. 

For Einstein coefficients: the original article (1.31), Cagnac and Pebay- 
Peyroula (11.2), chap. Ml and chap. XIX, $ 4. 

For the exact definition of oscillator strength: Sobel'man (11.12), chap. 9, $31. 

For atomic multipole radiation and its selection rules: Sobel’man (11.12), chap. 9, 
§ 32. 
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LINEAR AND NON-LINEAR RESPONSES 
OF A TWO-LEVEL SYSTEM SUBJECT 
TO A SINUSOIDAL PERTURBATION 


1. Description of the model 
a. Bloch equations for a system of spin 1/2's interacting with a radiofrequency field 
b. Some exactly and approximately soluble cases 
c. The response of an atomic system 


2. The approximate solution of the Bloch equations for the system 


a. Perturbation equations 
b. The Fourier series expansion of the solution 
c. The general structure of the solution 


3. Discussion 


a. Zeroeth-order solution: competition between pumping and relaxation 
b. First-order solution: the linear response 

c. Second-order solution: absorption and induced emission 

d. Third-order solution: saturation effects and multiple-quanta transitions 


4. Exercises: applications of this complement 


In the preceding complement, we applied first-order time-dependent perturbation 
theory to the treatment of some effects produced by the interaction of an atomic 
system and an electromagnetic wave: appearance of an induced dipole moment. 
induced emission and absorption processes, etc. 

We shall now consider a simple example, in which it is possible to pursue the 
perturbation calculations to higher orders without too many complications. This 
will allow us to demonstrate some interesting “non-linear” effects : saturation 
effects, non-linear susceptibility, the absorption and induced emission of several 
photons, etc. In addition, the model which we shall describe takes into account 
(phenomenologically) the dissipative coupling of the atomic system with its sur- 
roundings (the relaxation process). This will enable us to complete the results 
related to the “linear response” obtained in the preceding complement. For example, 
we shall calculate the atom’s induced dipole moment, not only far from resonance, 
but also at resonance. 

Some of the effects we are going to describe are currently objects of a great 
deal of research. Their study necessitates very strong electromagnetic fields. Only 
recently (since the development of lasers) have we been able to produce such 
fields. New branches of research have thus appeared: quantum electronics, non- 
linear optics, etc. The calculation methods described in this complement (for a very 
simple model) are applicable to these problems. 
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1. Description of the model 


a. BLOCH EQUATIONS FOR A SYSTEM OF SPIN 1/2'S INTERACTING 
WITH A RADIOFREQUENCY FIELD 


We shall return to the system described in $4-a of complement F,y: a system 
of spin 1/2’s placed in a static field B, parallel to Oz, interacting with an oscillating 
radiofrequency field and subject to “pumping” and “relaxation” processes. 

If M (t) is the total magnetization of the spin system contained in the cell 
(fig. 6 of complement F,y), we showed in complement F,, that: 


E Ao) = nm- q MO) + yMlt) x Bit) (1) 


The first term on the right-hand side describes the preparation, or the “pumping” 
of the system: n spins enter the cell per unit time, each one with an elementary 
magnetization p, parallel to Oz. The second term arises from relaxation processes, 
characterized by the average time 7 required for a spin either to leave the cell or 
have its direction changed by collision with the walls. Finally, the last term of (1) 
corresponds to the precession of the spins about the total magnetic field : 


B(1) = Bye, + B, (t) (2) 


B(z) is the sum of a static field Bye, parallel to Oz and a radiofrequency field B, (+) 
of angular frequency w. 


COMMENTS: 


(i) The transitions which we shall study in this complement (which connect the 
two states | + > and | — > of each spin 1/2) are magnetic dipole transitions. 


(ii) One could question our using expression (1) relative to mean values rather 
than the Schródinger equation. We do so because we are studying a statistical 
ensemble of spins coupled to a thermal reservoir (via collisions with the cell 
walls). We cannot describe this ensemble in terms of a state vector : we must 
use a density operator (see complement E;n). The equation of motion of this 
operator is called a “master equation” and we can show that it is exactly 
equivalent to (1) (see complement F,,. $3 and 4, and complement E,,, where 
we show that the mean value of the magnetization determines the density 
matrix of an ensemble of spin 1/2’s). 

Actually, the master equation satisfied by the density operator and the 
Schródinger equation studied in $C-1 of chapter XIII have the same structure 
as (1): a linear differential equation, with constant or sinusoidally varying 
coefficients. The approximation methods we describe in this chapter are, 
therefore, applicable to any of these equations. 
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b. SOME EXACTLY AND APPROXIMATELY SOLUBLE CASES 


If the radiofrequency field B, (1) is rotating, that is, if: 


B,(t) = B, (e, cos wt + e, sin wt) (3) 


equation (1) can be solved exactly [changing to the frame which is rotating with B, 

transforms (1) into a time-independent linear differential system]. The exact solution 

of (1) corresponding to such a situation is given in $4-b of complement F y. 
Here, we shall assume B, to be linearly polarized along Ox: 


B,(t) = B, e, cos wt (4) 


In this case, it is not possible* to find an exact analytic solution of equation (1) 
(there is no transformation equivalent to changing to the rotating frame). We shall 
see, however, that a solution can be found in the form of a power series expansion 
in B,. 


COMMENT: 


The calculations we shall present here for spin |/2’s can also be applied 
to other situations in which we can confine ourselves to two levels of the system 
and ignore all others. We know (cf. complement C,,) that we can associate 
a fictitious spin 1/2 with any two-level system. The problem considered here is 
therefore that of an arbitrary two-level system subject to a sinusoidal pertur- 
bation. 


c. RESPONSE OF AN ATOMIC SYSTEM 


The set of terms which, in M ,„, M ,„ @,, depend on B, constitute the “ response” 
of the atom to the electromagnetic perturbation. They represent the magnetic dipole 
moment induced in the spin system by the radiofrequency field. We shall see that 
such a dipole moment is not necessarily proportional to B, ; the terms in B, represent 
the linear response, and the others (terms in B?, Bi, ...), the “non-linear response”. 
In addition, we shall see that the induced dipole moment does not oscillate only at 
the angular frequency w, but also at its various harmonics pw (p = 0, 2. 3, 4, ...). 


It is easy to see why we should be interested in calculating the response of an atomic 
system. Such a calculation constitutes an important part of the theory of the propagation of an 
electromagnetic wave in a material or of the theory of atomic oscillators, “ masers” or “lasers”. 


* A linearly polarized field results from the superposition of a left and a right circular component. 
It would be possible to find an exact solution for each of these components taken separately. However, 
equation (1) is not linear, in the sense that a solution corresponding to (4) cannot be obtained by super- 
posing two exact solutions. one of which corresponds to (3) and the other one to the field rotating in the 
opposite direction [in the term » MÆ x B which appears on the right-hand side of (1). Æ depends on B, J. 
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Consider an electromagnetic field. Because of the coupling between this field and the 
atomic system, a polarization appears in the material, due to the atomic dipole moments (arrow 
directed towards the right in figure 1). This polarization acts like a source term in Maxwell’s 
equations and contributes to the creation of the electromagnetic field (arrow directed towards 


Response of the atomic system 


Atomic 
dipole 
moments 


Electromagnetic 


field 


Maxwell’s equations 


FIGURE 1 


Schematic representation of the calculations to be performed in studying the propagation of an 
electromagnetic wave in a material (or the operation of an atomic oscillator, a laser or a maser). 
We begin by calculating the dipole moments induced in the material by a given electromagnetic 
field (the response of the atomic system). The corresponding polarization acts like a source term 
in Maxwell’s equations and contributes to the creation of the electromagnetic field. We then take 
the field obtained to be equal to the one with which we started. 


the left in figure 1). When we “close the loop”, that is, when we take the field so created to be 
equal to the one with which we started, we obtain the wave propagation equations in the material 
(refractive index) or the oscillator equations (in the absence of external fields, an electromagnetic 
field may appear in the material, if there is sufficient amplification : the system becomes unstable 
and can oscillate spontaneously). In this complement, we shall be concerned only with the first 
step of the calculation (the atomic response). 


2. The approximate solution of the Bloch equations of the system 
a. PERTURBATION EQUATIONS 


As in complement Fy, we set: 


Wy = — YB, (5) 
w, = — yB, (6) 
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hw, represents the energy difference of the spin states | + > and | — > (fig. 2). 
Substituting (4) into (2), and (2) into (1), we obtain, after a simple calculation : 

d Me. Ow, ; 

dr A: = nko — En +i z COS wt (4. — M,) (7-a) 

d M e Le 

a A = — T. + iwo, F iw, COS wt H, (7-b) 
with: 

M, = M, + iM, (8) 
[+> 
hw, 
FIGURE 2 


Energy levels of a spin 1/2 in a 

static magnetic field B,; vw, is the 

Larmor angular frequency in the 
|-> field B,. 


Note that the source term ny, exists only in the equation of motion of .4/_, since po 
is parallel to Oz, and the pumping is said to be longitudinal*. We also point out 
that the relaxation time can be different for the longitudinal components (.4_) and 
the transverse components (K , ) of the magnetization. For the sake of simplicity, 
we shall choose a single relaxation time here. 

Equations (7-a) and (7-b), called the “Bloch equations”, cannot be solved 
exactly. We shall therefore determine their solution in the form of a power series 
expansion in w,: 


M, = "AL +0, "4, +a VA, 4H tama + (9-a) 
My = OH, to PH, han, OM, +a ot MF, tu (9-b) 


Substituting (9-a) and (9-b) into (7-a) and (7-b), and setting equal the coefficients 
of terms in wł, we obtain the following perturbation equations: 


n=0 
d l 
FT A, = nuo — T Mg. (10-a) 
R 
= Og = — = A, + ivg "4 (10-b) 
dt + Tp + =< 0 + 


* In certain experiments, the pumping is “transverse” (Ho is perpendicular to B,). See exercise | 
at the end of the complement. 
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n#0 
d in) l (n) i (n- 1) (n- 1) 
qi M, = — T A, +5 cos wt [2a — MM, | (11-a) 
R 
d (n) l (n) ; (n) ; (n- 1) 
— M, =— —] "M, + ing "M, F ¡cos wt M, (11-b) 
dt e Tr E = 


b. THE FOURIER SERIES EXPANSION OF THE SOLUTION 


Since the only time-dependent terms on the right-hand side of (10) and (11) 
are sinusoidal, the steady-state solution of (10) and (11) is periodic, of period 2x/w. 
We can expand it in a Fourier series: 


+0 


VA, = Y GM eP” (12-a) 
a a 

oM, = Y ma, eo (12-b) 
p=-xw 


EA, and ‘a , represent the pw Fourier components of the nth-order solution. 
By taking “4 _ real and “4, and “4 _ as complex conjugates of each other, 
we obtain the following reality conditions: 


GA a = Pee Aa (13-a) 
MM, =[ 04, y (13-b) 


Substituting (12-a) and (12-b) into (10) and (11), and setting equal to zero 
the coefficient of each exponential e'?°', we find : 


n=0 

QA, = no Tr 

OM, =0 fp #0 

OM, =0 for all p (14) 
n#0 

(io +.) A, =4 Ve +M — SIM, — | 

l (15-a) 
|o F Wo) tE] MA, = F ine + i] (15-b) 


These algebraic equations can be solved immediately : 


n i n- n- n- n- 

a E A A A 
(ipo +7) (16-a) 

WA, =F l [mia + 0204] (16-b) 


Y ipo F Wo) + A 
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Thus, expressions (16) give the nth-order solution explicitly in terms of the 
(n — l)th-order solution. Since the zeroeth-order solution is known [cf. equa- 
tions (14)], the problem is, in theory, entirely solved. 


c. THE GENERAL STRUCTURE OF THE SOLUTION 


It is possible to arrange the various terms of the expansion of the solution 
in a double-entry table in which the perturbation order n labels the columns and the 
degree p of the harmonic pœw being considered labels the row. To zeroeth-order, 
only ‘Q is different from zero. By iteration, using (16), we can deduce the other 
non-zero higher-order terms (table 1), thus obtaining a “tree-like structure”. The 
following properties can be found directly by recurrence, using (16): 


ye 
(3) ad 
p=3 ger N 
Pas 5 
(2) 
p=2 a A 
A 
(at) oO N a) a 
p=1 ¡AM y, on 
we = 
(0) (2) 
p=0 oM: o y 
Pi aa a ore wo 
p= — 1 -1M N 
ee y 
Se a ed N 
p= -2 - M, 
5 A 
Na a 
p=-3 -3M y 
~ 
n=0 n=1 n=2 n=3 


TABLE | 


Double-entry table indicating the pw Fourier components of the magnetization which are non-zero 
to the nth perturbation order in œ. 


(i) At even perturbation orders, only the longitudinal magnetization is 
modified; at odd orders, only the transverse magnetization. 

(ii) At even perturbation orders, only the even harmonics are involved ; at odd 
orders, only the odd harmonics. 

(iii) For each value of n, the values of p to be retained are n, n — 2, ..., 
=n+2,—n. 
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COMMENT: 


This structure is valid only for a particular polarization of the radio- 
frequency field B,(z) (perpendicular to B,). Analogous tables could be 
constructed for other radiofrequency polarizations. 


3. Discussion 


We shall now interpret the results of this calculation, through third order. 


a. ZEROETH-ORDER SOLUTION: COMPETITION BETWEEN PUMPING 
AND RELAXATION 


According to (14), the only non-zero zeroeth-order component is: 
GA, = Mok (17) 


In the absence of radiofrequency fields, there is therefore only a static longitudinal 
magnetization (p = 0). Since .4_ is proportional to the population difference of the 
states | + > and | — > shown in figure 2 (cf. complement E,,), it can also be said 
that the pumping populates these two states unequally. 

The larger the number of particles entering the cell (the more efficient the 
pumping) and the longer 7p (the slower the relaxation), the larger 94.. The zeroeth- 
order solution (17) therefore describes the dynamic equilibrium resulting from 
competition between the pumping and relaxation processes. 

From now on, in order to simplify the notation, we shall set: 


Me = OM, (18-a) 
le (18-b) 
b. FIRST-ORDER SOLUTION: THE LINEAR RESPONSE 


To first order, only the transverse magnetization Mæ , is different from zero. 
Since Æ, = .@*, it suffices to study æ ,. 


a. Motion of the transverse magnetization 


According to table 1, forn = l, we have p = + 1. Settingn = landp = + | 
in (16-b), using (18), we get: 


Mo l 
22 wy, — 0 + i 
Mo l 


ieS (19-a) 


Dy, = (19-b) 
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Substituting these expressions into (12-b) and then into (9-b), we obtain Æ , to first 
order in w;: 


Mo ei! 7 tor 
a AA PE 
a E E -ot+ik g +o + | (20) 


The point representing .@ , describes the same motion in the complex plane 
as the projection M, of in the plane perpendicular to B,. According to (20), this 
motion results from the superposition of two circular motions with the same angular 
velocity, one of them right circular (the e'"! term) and the other left circular (the 
e" term). The resulting motion, in the general case, is therefore elliptical. 


P. Existence of two resonances 


The right circular motion has a maximum amplitude when w, = w, and the 
left circular motion, when wọ = — œw. M therefore presents two resonances (while 
for a rotating field. there was a single resonance; see complement F,,,). The inter- 
pretation of this phenomenon is as follows: the linear radiofrequency field can be 
broken down into a left and a right circular field, each of which induces a resonance; 
since the rotation directions are opposed, the static fields B, for which these 
resonances appear are opposed. 


Y. Linear susceptibility 


Near a resonance (w, = w, for example), we can neglect the non-resonant 
term in (20). We then get: 
M glo! 
YM = — (21) 


Ma wzo | : 2 Wy — 0 + Hr 
«f , is therefore proportional to the rotating radiofrequency field component in 
the direction corresponding to the resonance, B, e'®'/2 in this case. 

The ratio of .æ , to this component is called the linear susceptibility 7(w): 


1 


xlo) = — == En 


(22) 
x(w) is a complex susceptibility because of the existence of a phase difference 
between M, and the rotating component of the radiofrequency field responsible 
for the resonance. 

The square of the modulus of x(w) has the classical resonant form in the 
neighborhood of w = wọ (fig. 3), over an interval of width: 


2 


do = ip = =F (23) 


The longer the relaxation time Tg, therefore, the sharper the resonance curve. From 
now on, we shall assume that the two resonances wọ = œw and @) = — w are 
completely separated, that is, that: 


oll, = Oh, > | (24) 
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A Ixo) 


y —»> 
0 @ 


FIGURE 3 


Variation of the square of the modulus |x(c)|? of the linear susceptibility of the spin system, 
with respect to œ. A resonance appears, of width 2/7,, centered at w = wo. 


The phase difference varies from 0 to + z when we pass through resonance. 
Itisequalto + 2/2atresonance: itiswhen#, and the rotating component are out of 
phase by 1/2 that the work of the couple exerted by the field on M is maximal. The 
sign of this work depends on the sign of .4 ,, that is, qn that of uo: it depends on 
whether the spin states of the entering particles are | + > or | — >. In one case 
(spins entering in the lower level), the work is furnished by the field, and energy is 
transferred from the field to the spins (absorption). In the opposite case (particles 
entering in the higher level), the work is negative, and energy is transferred from the 
spins to the field (induced emission). The latter situation occurs in atomic amplifiers 
and oscillators (masers and lasers). 


c. SECOND-ORDER SOLUTION: ABSORPTION AND INDUCED EMISSION 


To second order, according to table 1, only 34, and ‚24, are non-zero. 
First, we shall study G4,, that is, the static population difference of the states | +> 
and | — > to second order. We shall then consider 34,, that is, the generation of 
the second harmonic. 


o. Variation of the population difference of the two states of the system 


GM, corrects the zeroeth-order result obtained for (9,4 ,. According to (16-a) 


and (13-b): 
i 
QA, = Gp LA. + APM - VM, — QM] 
i 
= gp 04 Me — GE. — Pa] (25) 
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which, according to first-order solutions (19-a) and (19-b), yields: 


M 
DA, = — | _ + aa (26) 


4 | (wm - OAH +r}  (w +00)? +T1 
Grouping the static terms (p = 0) through second order in (9-a), we get: 


CO 1 
M (static) = Mo + arar Forn 
TR (o + 0) + TR 


(27) 


Figure 4 represents this static longitudinal magnetization as a function of wọ. 


A M (static) 


— 0) 0 w Wo 


FIGURE 4 


Variation of the static longitudinal magnetization with respect to œw. To second order in the 
perturbation treatment, there appear two resonances of width 2/7}, centered at œo = œ and 
wo = — w. The calculation is valid only if the relative intensity of the resonances is small, 
that is, if o, Th < l. 


The population difference is therefore always decreased, to second order, 
relative to its value in the absence of radiofrequency, and the decrease is propor- 
tional to the intensity of the radiofrequency field. This is simple to understand : 
under the effect of the incident field, transitions are induced from | + > to | — > 
(induced emission) or from | — > to | + > (absorption); whatever the sign of the 
initial population difference, the transitions from the more populated state are the 
more numerous, so that they decrease the population difference. 
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The maximum value of w? | G lis Mowi R = WowiTR/4 (the 
resonance amplitude which appears as a dip in figure 4). For the perturbation 
expansion to make sense, it is therefore necessary that: 


wilk <1 (28) 


p. Generation of the second harmonic 
According to (16-a), (13-b), (19-a) and (19-b): 


I 


(2) A PA 
ane. 4(20) = iTp) 


(yx _ (1) 
LM? OM] 


8Lo — ¡Ilo +o — ilk Wo —w til 


CM, describes a vibration of the magnetic dipole along Oz at the angular 
frequency 2w. The system can therefore radiate a wave of angular frequency 2%, 
polarized (as far as the magnetic field is concerned) linearly along Oz. 

Thus, we see that an atomic system is not generally a linear system. It can 
double the excitation frequency, triple it (as we shall see later), etc. The same type 
of phenomenon exists in optics for very high intensities (“non-linear optics”) : 
a red laser beam (produced, for example, by a ruby laser) falling on a material such 
as a quartz crystal can give rise to an ultraviolet light beam (doubled frequency). 


COMMENT: 


It will prove useful to compare |R| and |32@,| in the neighborhood 
of w = œw. According to (29), for w ~ wo, we have: 


PA = Eo (30) 
Similarly, (26) indicates that: 

[GA] = r (31) 
Therefore, for w = wy: 

[Gal te 2 |g, (32) 
[ZA] 4a.  409Tp 


according to (24). 
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d. THIRD-ORDER SOLUTION: SATURATION EFFECTS 
AND MULTIPLE-QUANTA TRANSITIONS 


To third order, table 1 shows that only ¿4 , and ¿314 , are non-zero; it 
suffices to study 94 ,. 

A, corrects to third order the right circular motion of #, , found to first 
order and analyzed in $b above. We shall see that $24, corresponds to a saturation 
effect in the susceptibility of the system. 

GM , represents a new component of angular frequency 3w of the motion 
of M, (generation of the third harmonic). Moreover, the resonant nature of S , 
in the neighborhood of w, = 3w can be interpreted as resulting from the simultaneous 
absorption of three radiofrequency photons, a process which conserves both the 


total energy and the total angular momentum. 


a. Saturation of the susceptibility of the system 
According to (16-b): 


(3) = l 1 (2) (2) 

ME TEA (33) 
Since we are interested in the correction to the right circular motion discussed in 
$ 3-b, which is resonant at w = @,, we shall place ourselves in the neighborhood 
of w =w. We can then, according to the comment in the preceding section 
[cf. formula (32)], neglect 54, compared to Gæ.. Thus we obtain, using expres- 
sion (26) for (514, (neglecting the term whose resonance peak is at œ = — 0): 


My 1 


DA, = — m 
8 0-0 +ik (o—w) +T} 


(34) 


If we regroup results (34) and (19-a), we find the expression for the right 
circular motion of .@, at the frequency w/2z, to third order in w,: 


MW, (right circular) = 


y iwt 2 1 
2 0-0 +ilz 4 (w — wm) +Tk 


Comparing (35) and (21), we see that the susceptibility of the system goes from 
value (22) to the value: 


A A E [ ms ——— | (36) 


Oy — 0 +ilp 4 (mw — w) +13 


It is therefore multiplied by a factor smaller than one; the greater the intensity of 
the radiofrequency field and the nearer we are to resonance, the smaller the factor. 
The system is then said to be “saturated”. The œw? term of (36) is called the “ non- 
linear susceptibility”. 
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The physical meaning of this saturation is very clear. A weak electromagnetic 
field induces in the atomic system a dipole moment which is proportional to it. 
If the field amplitude is increased, the dipole cannot continue to increase propor- 
tionally to the field. The absorption and emission transitions induced by the field 
decrease the population difference of the atomic states involved. Consequently, the 
atomic system responds less and less to the field. Furthermore, we see that the term 
in brackets in (36) is none other than the term which expresses the decrease in the 
population difference to second order [cf. formula (27), in which the term resonant 
at w = — w was neglected ]. 


COMMENT: 


The saturation terms play a very important role in all maser or laser theories. 
Consider figure | again. If we keep only the linear response term in the first step of the 
calculation (arrow directed to the right), the induced dipole moment is proportional to 
the field. If the material amplifies (and if the losses of the electromagnetic cavity are 
sufficiently small), the reaction of the dipole on the field (arrow directed to the left) tends 
to increase the field by a quantity proportional to it. Thus, we obtain for the field a linear 
differential equation which leads to a solution which increases linearly with time. 

It is the saturation terms that prevent this unlimited increase. They lead to an 
equation whose solution remains bounded and approaches a limit which is the steady- 
state laser field in the cavity. Physically, these saturation terms express the fact that 
the atomic system cannot furnish the field with an energy greater than that corresponding 
to the population difference initially introduced by the pumping. 


PB.  Three-photon transitions 
According to (16-b) and (29): 


1 1 (2) 
og — 30o +i 


2 
a PS: a a SA 
16 0-30 +i, 20 — iR| o +0 ihk o-o +ik 
(37) 


With respect to the term (34 ,, we could make the same comment as for u, : 
the atomic system produces harmonics of the excitation frequency (here, the third 
harmonic). 

The difference with the discussion of the preceding section relative to G, 
is the appearance of a resonance centered at wọ = 3w [due to the first resonant 
denominator of (37)]. 

We can give a particle interpretation of the w, = w resonance discussed in the 
preceding sections: the spin goes from the state | — > to the state | + > by 
absorbing a photon (or emitting it, depending on the relative positions of the | + > 
and | — > states). There is resonance when the energy hw of the photon is equal 
to the energy hw, of the atomic transition. We could give an analogous particle 
interpretation of the wọ = 3w resonance. Since hw, = 3hw, the transition neces- 
sarily involves three photons, since the total energy must be conserved. 
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We may wonder why no resonance has appeared to second order for hw, = 2hw 
(two-photon transition). The reason is that the total angular momentum must also 
be conserved during the transition. The linear radiofrequency field is, as we have 
already said, a superposition of two fields rotating in opposite directions. With each 
of these rotating fields are associated photons of a different type. For the right 
circular field, it is 9? photons, transporting an angular momentum + h relative 
to Oz. For the left circular field, it is a” photons, transporting an angular 
momentum — h. To go from the | — > state to the | + > state, the spin must absorb 
an angular momentum + h relative to Oz (the difference between the two eigen- 
values of S,). It can do so by absorbing a o* photon; if wọ = w, there is also conser- 
vation of the total energy, which explains the appearance of the w, = w resonance. 
The system can also acquire an angular momentum + h by absorbing three photons 
(fig. 5) : two o* photons and one a” photon. Therefore, if w = 3w, both energy and 
total angular momentum can be conserved, which explains the wg = 3% resonance. 
On the other hand, two photons can never give the atom an angular momentum + A: 
either both photons are o* and they carry 2h, or they are both a” and they carry — 2h, 
or one is a* and one is a” and they carry no total angular momentum. 


|+> 
A 7 A 
Eo o ho 
Apo { FIGURE 5 
ho, NA ot ho The spin can go from the | — > state to 
( the | + > state by absorbing three photons 
f of energy fico, The total energy is conserved 
T 4 if hw, = 3hw. The angular momentum is 
a gt ho conserved if two photons have a o* pola- 
) rization (each carries an angular momen- 
) tum +h relative to Oz) and the third 
y - Y has a o 7 polarization (it carries an angular 
| -=> momentum — fi). 


These arguments can easily be generalized and enable us to show that reso- 
nances appear when wo = w, 30, 5w, 7w, ..., (2n + 1)o, ..., corresponding to the 
absorption of an odd number of photons. Furthermore, we see from formula (16-b) 
that "tD, gives rise to a resonance peak for wọ = (2n + l)w. Nothing 
analogous occurs at even orders since, according to table 1, we must then use 
equation (16-a}. 


COMMENTS: 


(i) Ifthe field B, is rotating, there is only one type of photon, o* or a”. The same 
argument shows that a single resonance can then occur, at w, = w if the 
photons are o* andatw, = — wif they are o”. This enables us to understand 


1337 


www.elsolucionario.net 


6 COMPLEMENT Byun 


(iii) 
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why the calculations are much simpler for a rotating field and lead to an exact 
solution. It is instructive to apply the method of this complement to the case of 
a rotating field and to show that the perturbation series can be summed to give 
the solution found directly in complement Fy. 


Consider a system having two levels of different parities, subject to the influence 
of an oscillating electric field. The interaction Hamiltonian then has the 
same structure as the one we are studying in this complement : S, has only 
non-diagonal elements. Similarly, the electric dipole Hamiltonian, since it is 
odd, can have no diagonal elements. In the second case, the calculations are 
very similar to the preceding ones and lead to analogous conclusions : 
resonances are found for wy = w, 3w, 5w, ... The interpretation of the “odd” 
nature of the spectrum is then as follows: the electric dipole photons have 
a negative parity, and the system must absorb an odd number of them in 
order to move from one level to another of different parity. 


For the spin 1/2 case, assume that the linear radiofrequency field is neither 
parallel nor perpendicular to B, (fig. 6). B, can then be broken down into 
a component parallel to By, B,,, with which are associated z photons (with 
zero angular momentum relative to Oz), and a component B, ,, with which, 
as we have seen, 0? and o” photons are associated. In this case, the atom 
can increase its angular momentum relative to Oz by + h, and move from | => 


AB, 


FIGURE 6 


The static magnetic field B, and the radiofrequency field B,, 
in the case in which B, is neither parallel nor perpendicular 
to B, B,, and B,, are the components of B, parallel and 
perpendicular to By. 


to | + >, by absorbing two photons, one o* and the other z. It can be shown, 
by applying the method of this complement, that for this polarization of the 
radiofrequency, a complete (even and odd) spectrum of resonances appears : 
Wo = W, 20, 30, 40, ... 
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4. Exercises: applications of this complement 


EXERCISE | 


In equations (1), set w, = 0 (no radiofrequency) and choose po parallel to Ox 
(transverse pumping). 

Calculate the steady-state values of .4,, M, and M,. Show that 4, and M, 
undergo resonant variations when the static field is swept about zero (the Hanle 
effect), Give a physical interpretation of these resonances (pumping in competition 
with Larmor precession) and show that they permit the measurement of the pro- 
duct y Tp. 


EXERCISE 2 


Consider a spin system subject to the same static field B, and to the same 
pumping and relaxation processes as in this complement. These spins are also 
subjected to two linear radiofrequency fields, the first one of angular frequency w 
and amplitude B,, parallel to Oz, and the second one of angular frequency w’ and 
amplitude B;, parallel to Ox. 

Using the general methods described in this complement, calculate the 
magnetization M of the spin system to second order in w, = — yB, and 
w = — yB; (terms in w?, œw, ww). We fix w = — yB, and w,. Assume 
w > w, and let w’ vary. Show that, to this perturbation order, two resonances 
appear, one at w = wọ — w and the other at w’ = wọ + 0. 

Give a physical interpretation of these two resonances (the first one 
corresponds to a two-photon absorption, and the second, to a Raman effect). 


References and suggestions for further reading: see section 15 


Semiclassical theories of masers and lasers: Lamb (15.4) and (15.2), Sargent 
et al. (15.5), chap. VIII, IX and X. 

Non-linear optics: Baldwin (15.19), Bloembergen (15.21), Giordmaine (15.22). 

Iterative solution of the master equation: Bloembergen (15.21), chap. 2, §§3, 4 
and 5 and Appendix II. 

Multiphoton processes in R. F, range, Hanle effect: Brossel's lectures in (15.2). 
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Complement Cy 


OSCILLATIONS OF A SYSTEM BETWEEN 
TWO DISCRETE STATES UNDER THE EFFECT 
OF A SINUSOIDAL RESONANT PERTURBATION 


1. The method: secular approximation 
2. Solution of the system of equations 
3. Discussion 


The approximation method used to calculate the effect of a resonant perturbation 
in chapter XIII is not valid over long periods of time. We have seen [cf. condi- 
tion (C-18) of this chapter] that ¢ must satisfy: 


h 
— 1 


Suppose we want to study the behavior of a system subjected to a resonant 
perturbation over a considerable time [for which condition (1) is not satisfied ]. Since 
the first-order solution is then insufficient, we could try to calculate a certain 
number of higher-order terms to obtain a better expression for 2,(t; w): 


Bolt 0) = [A bY (0) + AMA) + BPE + ...|? (2) 


Such a method would lead to unnecessarily long calculations. 

We shall see here that it is possible to solve the problem more elegantly 
and rapidly by fitting the approximation method to the resonant nature of the 
perturbation. The resonance condition w = œp; implies that only the two discrete 
states |g; > and | p,> are effectively coupled by W(t). Since the system, at the 
initial instant, is in the state | ø; > [b:(0) = 1], the probability amplitude b,(t) of 
finding it in the state | y y > at time ¢ can be appreciable. On the other hand, all the 
coefficients b,(1) (with n # i, f) remain much smaller than | since they do not 
satisfy the resonance condition. This is the basis of the method we shall use. 


1. The method: secular approximation 


In chapter XIII, we replaced all the components b,(:) on the right-hand 
side of (B-11) by their values b,(0) at time ¢ = 0. Here, we shall do the same thing 
for the components for which k # i, f. However, we shall explicitly keep b;(1) 
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and by(1). Thus, in order to determine b,(t) and b,(1), we are led to the system of 
equations [the perturbation having the form (C-1-a) of chap. XIII]: 


int b(t) = xf fies NN dl W, b{t) + [erecta =! eee Wee b(t) \ 


; d l ilw +0 ft Ho ofM iot — iwit 
iħ g PAD =o e tosh — ¿Hee My, bdt) + [e — e JW, bt) } 


(3) 
On the right-hand side of these equations, certain coefficients of b,(t) and 6 ,(1) 
are proportional to e*°~°#", so they oscillate slowly in time when w = wp; 
On the other hand, the coefficients proportional either to e*i% or to e* Motos 
oscillate much more rapidly. Here, we shall use the secular approximation, which 
consists of neglecting the second type of terms. The remaining ones, called “secular 
terms”, are then those whose coefficients reduce to constants for w = Opie When 
integrated over time, they make significant contributions to the variations of the 
components b;(+) and 5,(t). On the other hand, the contribution of the other terms 
is negligible, since their variation is too rapid (the integration of e'* causes a 
factor 1/Q to appear, and the mean value of e*? over a large number of periods is 
practically zero). 


COMMENT: 


For the preceding argument to be valid, 1t is necessary for the temporal 
variation of a term eb. y(t) to be due principally to the exponential, and 
not to the component b; p(t). Since w is very close to w p;, this means that b; ,(1) 
must not vary very much over a time interval of the order of 1/ lori: This is 
indeed true with the assumptions we have made, that is, with W < H,. The 
variations of b;(1) and 5,(f) (which are constants if W = 0) are due to the 
presence of the perturbation W, and are appreciable for times of the order 
of h/|W,,| [this can be verified directly from formulas (8), obtained below]. 
Since by hypothesis |W,,| < A |w,,|, this time is much greater than I/|q,;|. 


In conclusion, the secular approximation leads to the system of equations : 


d l 


kas ee i(o—w fiM ; = 
E bit) = AA (4-2) 
d b (t) = + eto sit W, f b(t) (4-b) 
dt f 2h ee 


whose solution, very close to that of system (3), is easier to calculate, as we shall 
see in the next section. 


2. Solution of the system of equations 


We shall begin by considering the case for which w = w,;. Differentia- 


ting (4-a) and substituting (4-b) into the result, we obtain: 
d? 1 
— bát) = — — |W,]|? b(t 5 
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Since the system is in the state | p; > at time 1 = 0, the initial conditions are: 


b(0) = (6-a) 

b (0) = 0 (6-b) 
which, according to (4), gives: 

Din 

sap O=O (a) 

db W,, 

LS = —fi E 

ar O ==> (7-b) 
The solution of (5) which satisfies (6-a) and (7-a) can be written: 

b{t) = cos (=) (8-a) 


We can then calculate b, from (4-a): 
b(t) = e% sin (E) (8-b) 


where Api is the argument of W,;. The probability A(t; w = wy,) of finding the 
system in the state | y y > at time / is therefore, in this case, equal to: 


Pit; = wp) = sin? (Ee) (9) 


When o is different from w,; (while remaining close to the resonance value), 
the differential system (4) is still exactly soluble. In fact, it is completely analogous 
to the one we obtained in complement F,y [cf. equation (15)] in studying the 
magnetic resonance of a spin 1/2. The same type of calculation as in that complement 
leads to the analogue of relation (27) (Rabi’s formula), which can be written here: 


2 
P(t; w) = ai La sin? pi (o — wy)? ;| (10) 
|W. |? +%%w0 — wy)? 2 


[when w = wy,, this expression does reduce to (9)]. 


3. Discussion 


The discussion of the result obtained in (10) is the same as that of the magnetic 
resonance of a spin 1/2 (cf. complement Fiy, §2-c). The probability 2,(£; œ) is an 
oscillating function of time; for certain values of £, A,(¢; w) = 0, and the system has 
gone back into the initial state | g; >. 

Furthermore, equation (10) measures the magnitude of the resonance pheno- 
menon. When w = wp; however small the perturbation is, it can cause the system 
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to move completely from the state | g; > to the state | p y >*. On the other hand, 
if the perturbation is not resonant, the probability 2,(f; œw) always remains less 
than 1. 

Finally, it is interesting to compare the result obtained in this complement with 
the one obtained using the first-order theory in chapter XIII. First of all, note that, 
for all values of z, the probability 4 ,(1: œ) obtained in (10) is included between 0 
and 1. The approximation method used here therefore enables us to avoid the 
difficulties encountered in chapter XIII (cf. $ C-2-c-B). When we let t approach zero 
in (9), we get (C-17) of this chapter. Thus, first-order perturbation theory is indeed 
valid for ¢ sufficiently small (cf. comment of § B-3-b). It amounts to replacing the 
sinusoid which represents 2,(1; œ) as a function of time by a parabola. 


* The magnitude of the perturbation, characterized by |W. enters, at resonance, only into the 
time taken by the system to move from | p, > to | p, >. The smaller | W,,|, the longer the time. 
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DECAY OF A DISCRETE STATE RESONANTLY 
COUPLED TO A CONTINUUM OF FINAL STATES 


1. Statement of the problem 
2. Description of the model 


a. Assumptions about the unperturbed Hamiltonian Ho 

b. Assumptions about the coupling W 

c. Results of first-order perturbation theory 

d. Integrodifferential equation equivalent to the Schrédinger equation 


3. Short-time approximation. Relation to first-order perturbation theory 


> 


. Another approximation method for solving the Schrödinger equation 
5. Discussion 


a. Lifetime of the discrete state 
b. Shift of the discrete state due to the coupling with the continuum 
c. Energy distribution of the final states 


1. Statement of the problem 


In § C-3 of chapter XIII, we showed that the coupling induced by a constant 
perturbation between an initial discrete state of energy E; and a continuum of final 
states (some of which have an energy equal to E;) causes the system to go from the 
initial state to this continuum of final states. More precisely, the probability of 
finding the system in a well-defined group of states of the continuum at time t 
increases linearly with time. Consequently, the probability %2,(1) of finding the 
system in the initial state | ọ; > at time ¢ must decrease linearly over time from the 
value 2,(0) = 1. It is clear that this result is valid only over short times, since 
extrapolation of the linear decrease of &,(t) to long times would lead to negative 
values of 42;(t), which would be absurd for a probability. This raises the problem 
of determining the behavior of the system after a long time. 

We encountered an analogous problem when we studied the resonant transi- 
tions induced by a sinusoidal perturbation between two discrete states |œ; > 
and | p s >» First-order perturbation theory predicts a decrease proportional to 1? 
of Z (t) from the initial value Z,(0) = 1. The method presented in complement Cyin 
shows that the system actually oscillates between the states | p; > and | g, >. The 
decrease with 1? found in $C of chapter XIII merely represents the “beginning” 
of the corresponding sinusoid. 

We might expect to find an analogous result in the problem with which we 
are concerned here (oscillations of the system between the discrete state and the 
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continuum). We shall show that this is not the case: the physical system leaves the 
state | p; > irreversibly. We find an exponential decrease e~"' for Z(t) (for which 
the perturbation treatment gives only the short-time behavior 1 — It). Thus, the 
continuous nature of the set of final states causes the reversibility found in 
complement C,,,, to disappear; it is responsible for a decay of the initial state, 
which thus acquires a finite lifetime (unstable state; cf. complement Ky). 

The situation envisaged in this complement is very frequently encountered in 
physics. For example, a system, initially in a discrete state, can split, under the effect 
of an internal coupling (described, consequently, by a time-independent Hamil- 
tonian W), into two distinct parts whose energies (kinetic in the case of material 
particles and electromagnetic in the case of photons) can have. theoretically, any 
value; this gives the set of final states a continuous nature. Thus, in a-decay, a 
nucleus which is initially in a discrete state is transformed (via the tunnel effect) 
into a system composed of an a-particle and another nucleus. A many-electron 
atom A which is initially in a configuration (cf. complement A, y and By) in 
which several electrons are excited can, under the effect of electrostatic interactions 
between electrons, give rise to a system formed of an ion 4* and a free electron 
(the energy of the initial configuration must, of course, be greater than the simple 
ionization limit of A) : this is the “autoionization” phenomenon. We can also cite 
the spontaneous emission of a photon by an excited atomic (or nuclear) state : the 
interaction of the atom with the quantized electromagnetic field couples the discrete 
initial state (the excited atom in the absence of photons) with a continuum of final 
states (the atom in a lower state in the presence of a photon of arbitrary direction, 
polarization and energy). Finally, we can mention the photoelectric effect, in which 
a perturbation, now sinusoidal, couples a discrete state of an atom A toa continuum 
of final states (the ion 4* and the photoelectron e”). 

These few examples of unstable states taken from various domains of physics 
are sufficient to indicate the importance of the problem we are treating in this 
complement. 


2. Description of the model 


a. ASSUMPTIONS ABOUT THE UNPERTURBED HAMILTONIAN H, 


To simplify the calculations as much as possible, we shall make the following 
assumptions about the spectrum of the unperturbed Hamiltonian Ho. This spectrum 
includes: 


(i) a discrete state | p, > of energy E, (non-degenerate) : 


H,|%;> = E; | 9;> (1) 
(ii) a set of states | a > which form a continuum: 
Hola» = Eja» (2) 


E can take on a continuous infinity of values, distributed over a portion of the real 
axis including E;. We shall assume, for example, that E varies from 0 to + œ: 


E>0 (3) 
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Each state | æ > is characterized by its energy E and a set of other parameters 
which we shall denote by £ (as in $ C-3-a-B of chapter XIII). | « > can therefore also 
be written in the form | f, E >. We have [cf. formula (C-28) of chap. XIII] : 


da = p(f, E) df dE (4) 
where p(f$, E) is the density of final states. 


The eigenstates of H, satisfy the following relations (orthogonality and closure 
relations) : 


<p lp) =1 (5-a) 
<g;|%) =0 (5-b) 
<a | a’ > = ó(a — a) (5-c) 
je> co] + anar Cal = (6) 
b. ASSUMPTIONS ABOUT THE COUPLING W 


We shall assume that W is not explicitly time-dependent and has no diagonal 
elements: 


{9;|W|o,>=<a|Wla>=0 (7) 


(if these diagonal elements were not zero, we could always add them to those of Ho, 
which would simply amount to changing the unperturbed energies). Similarly, we 
shall assume that W cannot couple two states of the continuum: 


<a|Wla’>=0 (8) 


The only non-zero matrix elements of W are then those which connect the discrete 
state | P; X with the states of the continuum. It is these matrix elements, < a | W | Pi’), 
which are responsible for the decay of the state | p; >. 


The preceding assumptions are not too restrictive. In particular, condition (8) is very 
often satisfied in the physical problems alluded to at the end of §1. The advantage of this model 
is that it enables us to investigate the physics of the decay phenomenon without too many compli- 
cated calculations. The essential physical conclusions would not be modified by using a more 
elaborate model. 


Before taking up the new method of solving the Schrödinger equation which 
we are describing in this complement, we shall indicate the results of the first-order 
perturbation theory of chapter XIII as they apply to this model. 


c. RESULTS OF FIRST-ORDER PERTURBATION THEORY 


The discussion of §C-3 of chapter XIII enables us to calculate [using, in 
particular, formula (C-36)] the probability of finding the physical system at time t 
(initially in the state | g; >) in a final state of arbitrary energy belonging to a group 
of final states characterized by the interval 68, around the value f,. 
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Here, we shall concern ourselves with the probability of finding the system 
in any of the final states |a >: neither E nor $ is specified. We must therefore 
integrate expression (C-36) of chapter XIII with respect to f, which gives the 
probability density [the integration over the energy was already performed 
in (C-36)]. Thus, we introduce the constant : 


The desired probability is then equal to F +. With the assumptions of § a, it represents 
the probability of the system's having left the state | P; > at time £. If we call 42 (1) 
the probability that the system is still in this state at time ¢, we have: 


Pt) =1 Fi (10) 

In the discussion of the following sections, it is important to recall the 
validity conditions for (10): 

(i) Expression (10) results from a first-order perturbation theory which is valid 
only if 2;(t) differs only slightly from its initial value 2,(0) = 1. We then must 
have: 


(11) 


| 


SS 


(ii) Furthermore, (10) is valid only for sufficiently long times t. 

To state the second condition more precisely, and to see, in particular, if it is 
compatible with (11), we return to expression (C-31) of chapter XIII ($ and E are 
no longer constrained to vary only inside the intervals ôf, and 6E,). Instead of 
proceeding as we did in chapter XIII, we shall integrate the probability density 
appearing in (C-31), first over 6 and then over E. The following integral then 
appears : 


0 E — E; 
al aE F(t ; ) KE) (12) 


where K(E), which results from the first integration over $, is given by: 


K(E) = faske E! W | p; >|? p(B, E) (13) 


h 
at E = E, and of width 4xh/+. 
Let hA be the “width” of K(£) : hA represents the order of magnitude of the E 
variation needed for K(E) to change significantly (cf. fig. 1). As soon as ¢ is suffi- 
ciently large that: 


F 6 = =) is the diffraction function defined by (C-7) of chapter XIII, centered 


(14) 
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> 
E 


FIGURE | 


E — 
Variation of the functions K(E) and F (. ji 


E, 
) with respect to E. The respective “widths” 


of the two curves are of the order of fA and 4xh/1. For sufficiently large /, F (. E ; =!) behaves 


like a “delta function” with respect to K(E). 


h 
tion (C-32) of chapter XIII, we can then write (12) in the form: 


F 6 = 2) behaves like a “delta function” with respect to K(£). Using rela- 


2nt 


ah [azote — E) K(E) == K(E = E) = Ti (15) 


F t 
since by comparing (9) and (13), it can easily be seen that: 
— K(E = E) =T (16) 


Again we find that the linear decrease appearing in (10) is valid only if ¢ is 
large enough to satisfy (14). 
Conditions (11) and (14), obviously, are compatible only if: 


A>T (17) 
We have thus given a quantitative form to the condition stated in the note on 


page 1300. In the rest of this complement, we shall assume that inequality (17) is 
satisfied. 
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d. INTEGRODIFFERENTIAL EQUATION EQUIVALENT 
TO THE SCHRODINGER EQUATION 


It is easy to adapt expressions (B-11) of chapter XIII to the case we are 
studying here. 

The state of the system at time ¢ can be expanded on the { | P; Y), 
basis: 


a>} 


| W(t) > = bd) e Pp) + E b(a, 1) EP] a > (18) 


When we substitute state vector (18) into the Schródinger equation, using the 
assumptions stated in $$ 2-a and 2-b, we obtain, after a calculation that is analogous 
to the one in $ B-I of chapter XIII, the following equations of motion: 


in b(t) = [ox oP y, | W | a > bla, t) (a 
iti bla, 1) = PENN < a | W | o; > bilt) (20) 


The problem consists of using these rigorous equations to predict the behavior of 
the system after a long time, taking into account the initial conditions: 


b(0) = 1 (21-a) 
bla, 0) = 0 (21-b) 


The simplifying assumptions which we made for W imply that © b(t) depends 


only on b(a, t}, and < b(a, t), only on 4,(t). Consequently, we can integrate equa- 
tion (20), taking initial condition (21-b) into account. Substituting the value obtained 
in this way for b(a, 1) into (19), we obtain the following equation describing the 
evolution of b (1): 

d 


=, Pat) = -faf dr ef E EM INC | W |p; >|? bAt’) (22) 
0 


By using (4) and performing the integration over f, we obtain, using (13): 


© b(t) = -5| dE | dt! K(E) eE Pe- bi’) (23) 
dt h? 0 0 


Thus, we have been able to obtain an equation involving only b, However, 
it must be noted that this equation is no longer a differential equation, but an 


integrodifferential equation: a b,(t) depends on the entire “history of the system 


between the times 0 and f. 
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Equation (23) is rigorously equivalent to the Schródinger equation. We do not 
know how to solve it exactly. In the following sections, we shall describe two 
approximate methods for solving this equation. One of them ($3) is equivalent to 
the first-order theory of chapter XIII; the other one ($4) enables us to study the 
long-time behavior of the system more satisfactorily. 


3. Short-time approximation. 
Relation to first-order perturbation theory 


If £ is not too large, that is, if b,(t) is not too different from 6,(0) = 1, we can 
replace b,(+') by b,(0) = 1 on the right-hand side of (23). This right-hand side then 
reduces to a double integral, over E and f’, whose integration presents no difficulties : 


x t 
| ae | ar ki esaeen a 
0 0 


We shall perform this calculation explicitly, since it allows us to introduce two 
constants [one of which is I’, defined by (9)] which play an important role in the more 
elaborate method described in $4. 

We shall begin by integrating over /' in (24). According to (47) of appendix II, 
the limit of this integral for + —> x is the Fourier transform of the Heaviside 
step function. More precisely : 


t 
: P 1 
Lim | eF Pu dr = ħ| nò(E; — E) +i 
Lam | T TÓO(E; — E) + i? EE (25) 


(we have set £ — 1” = 1). 

Actually, it is not necessary to let ¢ approach infinity in order to use (25) in the 
calculation of (24). It suffices for h/z to be very much smaller than the “width” AA 
of K(E), that is, for ¢ to be very much greater than 1/4. We again find the validity 
condition (14). If this condition is satisfied, we can then use (25) to write (24) in the 
form: 


K(E = E) -19 | ° LO dE (26) 


The first term of (26) is, according to (16), simply — T'/2. We shall set: 
ae ” K(E) 
SE = P | r dE (27) 
0 
Therefore, the double integral (24) is equal to: 
12 (28) 
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When b,(1") is replaced by 6,(0) = | in (23), this equation then becomes 
[as soon as (14) is satisfied] : 


d FOE 


rr ec eee ge (29) 
The solution of (29), using the initial condition (21-a), is very simple : 
r ÒE 
b(t) = 1 — G y oa (30) 
Obviously, this result is valid only if |b,(z)| differs only slightly from 1, that is, if: 
1 h 
tF > SE (31) 


This is the other validity condition, (11), for first-order perturbation theory. 

Using (30), we can easily calculate the probability 2,(1) = |b,(r)|? that the 
system is still in the state |g; > at time £. If we neglect terms in 1"? and dE’, we 
obtain: 


Ali) =1 Ti (32) 


All the results obtained in chapter XIII can then be deduced from equation (23) when 
b(t’) is replaced by 5,(0). This equation has also enabled us to introduce the param- 
eter E, whose physical significance will be discussed later [note that dF does not 
appear in the treatment of chapter XIII because we were concerned only with the 
calculation of the probability |b, D|. and not with that of the probability ampli- 
tude b,(f)]. 


4. Another approximate method 
for solving the Schrödinger equation 


A better approximation consists of replacing b(t) by b;(t) rather than by 5,(0) 
in (23). To see this, we shall begin by doing the integral over E which appears on the 
right-hand side of the rigorous equation, (23). We thus obtain a function of E; and 
ie 


g(E,,.t —t’)= - | dE K(E) ei Duo (33) 
o 


which is clearly different from zero only if £ — £' is very small. In (33), we are 
integrating over E the product of K(£), which varies slowly with E (cf. fig. 1), and an 
exponential whose period with respect to the variable E is 2zħ/(t — 1"). If we choose 
values of ¢ and /' such that this period is very much smaller than the width hA of K(£), 
the product of these two functions undergoes numerous oscillations when £ is varied, 
and its integral over E is negligible. Consequently, the modulus of g(£;, t — 1’) is 
large for £ — £' = O and becomes negligible as soon as £ — t > 1/4. This property 
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means that, for all z, the only values of 5, (t’) to enter significantly into the right-hand 
side of (23) are those which correspond to f' very close to 1(1 — t S 1/4). Indeed, 
once the integration over E has been performed, this right-hand side becomes : 


[ses — 1) bt) de (34) 


and we see that the presence of g(E,, t — 1”) practically eliminates the contribution 
of b(t) as soon as t — 1" > 1/4. 


E | ; 
Thus, the derivative ai b(t) has only a very short memory of the previous 


values of b,(t) between 0 and +. Actually, it depends only on the values of b; at times 
immediately before t, and this is true for all t. This property enables us to transform 
the integrodifferential equation (23) into a differential equation. If b,(t) varies very 
little over a time interval of the order of 1/4, we make only a small error by replacing 
b(t’) by b;(t) in (34). This yields: 


bdo | a. — dt = — (5 + iF) bt) (35) 


[to write the right-hand side of (35), we used the fact that the integral over ¢’ of 
g(E,, t — f') is simply, according to (33), the double integral (24) evaluated in $ 3 
above |. 

Now, according to the results of §3 (and as we shall see later), the time scale 
characteristic of the evolution of b;(t) is of the order of I/F or AjdE. The validity 
condition for (35) is then: 

r, = <A (36) 
which we have already assumed to be fulfilled [cf. (17)]. 

To a good approximation, and for all t, equation (23) can therefore be written: 


d f 
qr Pl) = — G + 5) b{t) (37) 


whose solution, using (21-a), is obvious: 


b(t) = e Fr e EVA (38) 
It can easily be shown that the limited expansion of (38) gives (30) to first order in F 
and OE. 

COMMENT: 


No upper bound has been imposed on ¢. On the other hand, the integral 


f g(E,, t — t') dt’ which appears in (35) is equal to — (['/2 + idE/h) only if 
0 
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t > 1/4, as we saw in $ 3 above. For very short times, the theory presented here 
suffers from the same limitations as perturbation theory; however, it has the 
great advantage of being valid for long times. 


If we now substitute expression (38) for b,(£) into equation (20), we obtain 
a very simple equation which enables us to determine the probability amplitude 
b(a, t) associated with the state | æ >: 


bla) == Ca] Wo) | E A (39) 
0 


that is: 
| L a- Tt/2 ACE Ei- Eih 
A L LAL OE aed eal so 


Equations (38) and (40), respectively, describe the decay of the initial state and 
the “filling” of the final states | a >. Now let us study in greater detail the physical 
content of these two equations. 


5. Discussion 


a. LIFETIME OF THE DISCRETE STATE 
According to (38), we have: 


P(t) = lb (1)? = e7" (41) 


FIGURE 2 


Variation with respect to time of the proba- 
bility of finding the system in the discrete 
state | y, > at time :. We obtain an exponential 


a tenis - —- —> decrease, e”, for which Fermi’s golden rule 
0 c= 1 t gives the tangent at the origin (this tangent is 
r represented by a dashed line). 


4, (1) therefore decreases irreversibly from 4;(0) = 1 and approaches zero as! —> x 
(fig. 2). The discrete initial state is said to have a finite lifetime 7; t is the time constant 
of the exponential of figure 2: 


E 
E 
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This irreversible behavior contrasts sharply with the oscillations of the system 
(Rabi’s formula) between two discrete states when it is subject to a resonant pertur- 
bation coupling these two states. 


b. SHIFT OF THE DISCRETE STATE DUE TO THE COUPLING 
WITH THE CONTINUUM 


If we go from b;(t) to c,(t) [ cf. formula (B-8) of chapter XIII], we obtain, 
from (38): 


c(t) = e Tu? e HE¿+8EJ /h (43) 
Recall that, in the absence of the coupling W, we would have: 
c(t) = e Eat (44) 


In addition to the exponential decrease, e” !'?, the coupling with the continuum is 
therefore responsible for a shift in the discrete state energy, which goes from E, 
to E; + ôE. This is the interpretation of the quantity 9£ introduced in $ 3. 

Let us analyze expression (27) for dE more closely. Substituting definition (13) 
of K(E) into (27), we get: 


=o] Faz | 66 AWB. CBE lo (45) 


or, if we use (4) and replace < $, E | by <a |: 


The contribution to this integral of a particular state | x > of the continuum, 
for which E # E,, is: 


(a Wig, >)? (47) 


E,—E 


We recognize (47) as a familiar expression in stationary pe-turbation theory 
[cf. formula (B-14) of chapter XI]. (47) represents the energy shift of the state | pp, > 
due to the coupling with the state |x >, to second order in W. 6£ is simply the sum 
of the shifts due to the various states | a > of the continuum. We might imagine that 
a problem would appear for the states | a > for which E = E,. Actually, the presence 
in (46) of the principal part 42 implies that the contribution of the states | æ > situated 
immediately above | p; > compensates that of the states situated immediately below. 
Summing up: 


(i) The coupling of | o, > with the states | a > of the same energy is responsible 
for the finite lifetime of | p, > [the function ó(£, — E) of formula (25) enters into 
the expression for T ]. 
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(ii) The coupling of | p, > with the states | a. > of different energies is responsible 
for an energy shift of the state | œ; >. This shift can be calculated by stationary 
perturbation theory (this was not obvious in advance). 


COMMENT: 


In the particular case of the spontaneous emission of a photon by an atom, 
JE represents the shift of the atomic level under study due to the coupling with the 
continuum of final states (an atom in another discrete state, in the presence of a photon). 
The difference between the shifts dE of the 2s,,, and 2p,,, states of the hydrogen atom 
is the “ Lamb shift” [cf. complement Ky, $ 3-d-8 and chapter XII, $ C-3-b, comment (iv)]. 


c. ENERGY DISTRIBUTION OF THE FINAL STATES 


Once the discrete state has decayed, that is, when 1 > 1/F, the final state of the 
system belongs to the continuum of states | æ >. It is interesting to study the energy 
distribution of the possible final states. For example, in the spontaneous emission 
of a photon by an atom, this energy distribution is that of the photon emitted when 
the atom falls back from the excited level to a lower level (the natural width of 
spectral lines). 

When ¢ > l/r, the exponential which appears in the numerator of (40) is 
practically zero. We then have: 


1 


bla, tl? ~ a!|W|o.> 2 _ a 
IA a a aria 


t> F 


(48) 


|b(a, £)? actually represents a probability density. The probability of finding the 


system, after the decay, in a group of final states characterized by the intervals df, 
and dE, about f, and E, can be calculated directly from (48): 


dA(B p Es, t) 


1 


= >]? n dE, (49 
|< Bp Es |W | 9; >| Plr E) E L raja A Es ( ) 


Let us examine the E,-dependence of the probability density : 


dP (Py, Es, t) 
dB, dE, 


Since |< B,, E, | W | 9; >|* p(B, Ey) remains practically constant when £, varies 
over an interval of the order of AI’, the variation of the probability density with 
respect to E, is essentially determined by the function: 


| 


(E, — E, — SEP + 1?T?/4 (79) 


and has, consequently, the form shown in figure 3. The energy distribution of the 
final states has a maximum for E, = E, + OE, that is, when the final state energy 
is equal to that of the initial state | p, >, corrected by the shift SE. 
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A dB, Ep t) 
dB, dE, 


alo 


A SS, 


> 
E, 


© 
m 
+ 
> 
ty 


FIGURE 3 


Form of the energy distribution of the final states attained by the system after the decay of the 
discrete state. We obtain a Lorentzian distribution centered at E, + JE (the energy of the discrete 
state corrected by the shift 6£ due to the coupling with the continuum). The shorter the lifetime z of 
the discrete state, the wider the distribution (time-energy uncertainty relation). 


The form of the distribution is that of a Lorentz curve of width AI, called the 
“natural width” of the state | P; X. An energy dispersion of the final states there- 
fore appears. The larger AI (that is, the shorter the lifetime t = I/F of the 
discrete state), the greater the dispersion. More precisely : 
AE, =hI = i (51) 
T 


Note again the analogy between (51) and the time-energy uncertainty relation. 
In the presence of the coupling W, the state | p; > can be observed only during 
a finite time, of the order of its lifetime t. When we want to determine its energy 
by measuring that of the final state of the system, the uncertainty AE of the result 
cannot be much less than h/t. 


References: 


The original article: Weisskopf and Wigner (2.33). 
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EXERCISES 


1. Consider a one-dimensional harmonic oscillator of mass m, angular fre- 
quency w, and charge q. Let | p, > and E, = (n + 1/2)how, be the eigenstates and 
eigenvalues of its Hamiltonian H.. 

For t < 0, the oscillator is in the ground state | pg >. At £ = 0, it is subjected 
to an electric field “pulse” of duration t. The corresponding perturbation can be 
written : 


— g&X fi < t< 
wit) = që or 0 T 
0 fort <0and 1 > 1 


& is the field amplitude and X is the position observable. Let A, be the probability 
of finding the oscillator in the state | p, > after the pulse. 


a. Calculate 2%, by using first-order time-dependent perturbation theory. 
How does &, vary with qt, for fixed wg? 

b. Show that, to obtain &,, the time-dependent perturbation theory calcu- 
lation must be pursued at least to second order. Calculate %,, to this perturbation 
order. 

c. Give the exact expressions for &, and &, in which the translation operator 
used in complement F, appears explicitly. By making a limited power series 
expansion in & of these expressions, find the results of the preceding questions. 


2. Consider two spin 1/2’s, S, and S,, coupled by an interaction of the form 
a(t)S, .S,: a(t) is a function of time which approaches zero when |r| approaches 
infinity, and takes on non-negligible values (on the order of a, ) only inside an interval, 
whose width is of the order of t, about ¢ = 0. 


a. Att = — œ, the system is in the state | +, — > (an eigenstate of S,, and S,, 
with the eigenvalues + h/2 and — h/2). Calculate, without approximations, the 
state of the system at £ = + œ. Show that the probability P(+ — —» — +) of 
finding, at £ = + 00, the system in the state | —, + > depends only on the integral 


| l a(t) dt. 


b. Calculate P(+ — —» — +) by using first-order time-dependent pertur- 
bation theory. Discuss the validity conditions for such an approximation by 
comparing the results obtained with those of the preceding question. 


c. Now assume that the two spins are also interacting with a static magnetic 
field B, parallel to Oz. The corresponding Zeeman Hamiltonian can be written : 


Ho = — Boly1S¡. + V2S22) 


where y, and y, are the gyromagnetic ratios of the two spins, assumed to be different. 
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Assume that a(t) = a¿e'**. Calculate P(+ — —» — +) by first-order 
time-dependent perturbation theory. With fixed a, and 7, discuss the variation of 
P(+ — — — +) with respect to Bo. 


3. Two-photon transitions between non-equidistant levels 


Consider an atomic level of angular momentum J = 1, subject to static electric 
and magnetic fields, both parallel to Oz. It can be shown that three non-equidistant 
energy levels are then obtained. The eigenstates | Qu > of A (M = — 1,90, + 1), 
of energies E, correspond to them. We set E, — Eo = hw ,, Eo — E_ 1 = hwo (Wo H 04). 

The atom is also subjected to a radiofrequency field rotating at the angular 
frequency w in the xOy plane. The corresponding perturbation W(r) can be 
written : 

Os 


W(t) =3 Y, e7 it + J el") 


where @, is a constant proportional to the amplitude of the rotating field. 


a. We set (notation identical to that of chapter XIII): 
+1 , 
IWA = Y bult) e Fm] py > 
M=-1 


Write the system of differential equations satisfied by the 6,,(r). 


b. Assume that, at time z = 0, the system is in the state | p_, >. Show that 
if we want to calculate b(t) by time-dependent perturbation theory, the calculation 
must be pursued to second order. Calculate b, (1) to this perturbation order. 


c. For fixed 1, how does the probability 2_, ,,(1) = |b (£)? of finding 
the system in the state | p, > at time z vary with respect to œw? Show that a resonance 
appears, not only for w = w, and w = wọ, but also for œw = (Wy + (0,2. Give 
a particle interpretation of this resonance. 


4. Returning to exercise 5 of complement H,, and using its notation, assume 
that the field B, is oscillating at angular frequency w, and can be written 
B,(t) = B, cos wt. Assume that b = 2a and that w is not equal to any Bohr angular 
frequency of the system (non-resonant excitation). 

Introduce the susceptibility tensor y, of components y;;(w), defined by: 


< M; X(t) = J Re [z lo) Boj ent] 


with i, j = x, y, z. Using a method analogous to the one in $ 2 of complement Axm 
calculate x; (œ). Setting œ = 0, find the results of exercise 5 of complement Hy;. 
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5. The Autler-Townes effect 


Consider a three-level system : | p, >, | p, >. and | p, >, of energies E,, E, 
and E,. Assume E, > E, > E, and E, — E, S E, — £,. 

This system interacts with a magnetic field oscillating at the angular 
frequency w. The states | p, > and | p, > are assumed to have the same parity, 
which is the opposite of that of | Q, >, so that the interaction Hamiltonian W (t) 
with the oscillating magnetic field can connect | p, > and | p, > to | p, >. Assume 
that, in the basis of the three states | Pı >, | @2 >, | @3 >, arranged in that order, 
W (1) is represented by the matrix: 


0 0 0 
0 0 w, Sin wt | h 
0 w, sin wf 0 


where œw, is a constant proportional to the amplitude of the oscillating field. 
a. Set (notation identical to that of chapter XIII): 


vit) > = Y b(t) eE] @, > 


i=1 


Write the system of differential equations satisfied by the b,(r). 


b. Assume that w is very close to w»,, = (E, — E,)/h. Making approximations 
analogous to those used in complement Cxi integrate the preceding system, 
with the initial conditions: 


b,(0) = 6,0) =—~  by(0)=0 


V2 
(neglect, on the right-hand side of the differential equations, the terms whose 
coefficients, e?“ * 0x1 vary very rapidly, and keep only those whose coefficients 
are constant or vary very slowly, as e? 07 03211), 
c. The component D_ along Oz of the electric dipole moment of the system 


is represented, in the basis of the three states | p, >, | o, >, | pz >. arranged in that 
order, by the matrix: 


0d 0 
d 0 0 
0 0 0 


where d is a real constant (D, is an odd operator and can connect only states of 
different parities). 

Calculate < D, >(t) = < W(t) | D, | W(t) >, using the vector | y(t) > calculated 
in b. 

Show that the time evolution of < D, >(r) is given by a superposition of 
sinusoidal terms. Determine the frequencies v, and relative intensities 7, of these 
terms. 
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These are the frequencies that can be absorbed by the atom when it is placed 
in an oscillating electric field parallel to Oz. Describe the modifications of this 
absorption spectrum when, for œw fixed and equal to w33, w, is increased from zero. 
Show that the presence of the magnetic field oscillating at the frequency w,,/2z 
splits the electric dipole absorption line at the frequency w,,/2z, and that the 
separation of the two components of the doublet is proportional to the oscillating 
magnetic field amplitude (the Autler-Townes doublet). 

What happens when, for w, fixed, œw — w3, is varied ? 


6. Elastic scattering by a particle in a bound state. Form factor 


Consider a particle (a) in a bound state | p, > described by the wave func- 
tion ~,(r,) localized about a point O. Towards this particle (a) is directed a beam 
of particles (b), of mass m, momentum fk,, energy E, = h*k?/2m and wave 


function ek» Each particle (b) of the beam interacts with particle (a). The 


nm) 
corresponding potential energy, W, depends only on the relative position r, — r, of 
the two particles. 


a. Calculate the matrix element: 
<a: pob: k, | W(R, — R,) |a: 9:6 =k, > 


of W (R, — R,) between two states in which particle (a) is in the same state | p, > 
and particle (b) goes from the state |k; > to the state |k y >. The expression for 
this matrix element should include the Fourier transform W (k) of the potential 
Wir, — r,): 


eo eee 
Wir, —¥r,) = aan | 70 e- ya) dak 


b. Consider the scattering processes in which, under the effect of the inter- 
action W, particle (b) is scattered in a certain direction, with particle (a) remaining 
in the same quantum state | gy, > after the scattering process (elastic scattering). 

Using a method analogous to the one in chapter XIII [cf. comment (ii) 
of § C-3-b ], calculate, in the Born approximation, the elastic scattering cross section 
of particle (b) by particle (a) in the state | po >. 

Show that this cross section can be obtained by multiplying the cross section 
for scattering by the potential W(r) (in the Born approximation) by a factor which 
characterizes the state | p, >, called the “form factor”. 
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7. A simple model of the photoelectric effect 


Consider, in a one-dimensional problem, a particle of mass m, placed in a 
potential of the form V(x) = — a6(x), where a is a real positive constant. 

Recall (cf. exercises 2 and 3 of complement K) that, in such a potential, there 
is a single bound state, of negative energy E, = — ma?/2h?, associated with a 


Ps ae ma 
normalized wave function p(x) = y majh? e “21. For each positive value of the 


energy E = h*k*/2m, on the other hand, there are two stationary wave functions, 
corresponding, respectively, to an incident particle coming from the left or from the 
right. The expression for the first eigenfunction, for example, is: 


Mes gr — aE | for x <0 
vem 1 + ih?k/ma 
+2 
= ah kma jis forx>0 
2n! + ih°k/ma 


x(x) = 


a. Show that the y,(x) satisfy the orthonormalization relation (in the extended 
sense): 


Cel ter? = 0(k — k') 


The following relation [cf formula (47) of appendix IL] can be used: 


0 x 
. ; . 1 
| e'* dx = | e “* dx = Lim - 
ae ô e>0 & + iq 


: l 
=x 0(q) — 9(7) 


Calculate the density of states p(E) for a positive energy E. 


b. Calculate the matrix element < x, | X | Po > of the position observable X 
between the bound state | p, > and the positive energy state | y, > whose wave 
function was given above. 


c. The particle, assumed to be charged (charge q) interacts with an electric 
field oscillating at the angular frequency œ. The corresponding perturbation is: 


W(t) = — q6 X sin wt 


where @ is a constant. 

The particle is initially in the bound state | p, >. Assume that hw > — Ep. 
Calculate, using the results of § C of chapter XIII [see, in particular, formula (C-37) ], 
the transition probability w per unit time to an arbitrary positive energy state (the 
photoelectric or photoionization effect). How does w vary with w and & ? 
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8. Disorientation of an atomic level due to collisions 
with rare gas atoms 


Consider a motionless atom 4 at the origin of a coordinate frame Oxyz (see 
figure). This atom Aisin a level of angular momentum J = 1, to which correspond the 
three orthonormal kets | M>(M = — 1.0, + 1), eigenstates of J, of eigenvalues Mh. 


A second atom B, in a level of zero angular momentum, is in uniform 
rectilinear motion in the xOz plane: it is travelling at the velocity v along a straight 
line parallel to Oz and situated at a distance b from this axis (b is the “impact 
parameter”). The time origin is chosen at the time when B arrives at point H of the Ox 
axis (OH = b). At time t, atom B is therefore at point M, where HM = vt. Call 0 
the angle between Oz and OM. 

The preceding model, which treats the external degrees of freedom of the 
two atoms classically, permits the simple calculation of the effect on the internal 
degrees of freedom of atom A (which are treated quantum mechanically) of a 
collision with atom B (which is, for example, a rare gas atom in the ground state). 
It can be shown that, because of the Van der Waals forces (cf. complement C,,) 
between the two atoms, atom 4 is subject to a perturbation W acting on its internal 
degrees of freedom, and given by: 

w=£y 

r 
where C is a constant, r is the distance between the two atoms, and J, is the component 
of the angular momentum J of atom 4 on the OM axis joining the two atoms. 


a. Express W in terms of C, b, v, t, J, J4 = J, + iJ,. Introduce the dimen- 
sionless parameter t = vt/b. 


b. Assume that there is no external magnetic field, so that the three states 
| + 1), |0>, | — 1> of atom A have the same energy. 

Before the collision, that is, at + = — œ, atom A is in the state | — 1». 
Using first-order time-dependent perturbation theory, calculate the probability 
P_, ,, of finding, after the collision (that is, att = + 00), atom A in the state | +1). 
Discuss the variation of 2_, ,, with respect to b and v. Similarly, calculate 2_, o- 
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c. Now assume that there is a static field B, parallel to Oz, so that the three 
states | M > have an additional energy Mh», (the Zeeman effect), where wọ is the 
Larmor angular frequency in the field Bo. 

æ. With ordinary magnetic fields (Bọ ~ 10? gauss), wa = 10° rad sec” !: 
b is of the order of SÁ. and v, of the order of 5 x 10? m. sec” '. Show that, under 
these conditions, the results of question b remain valid. 


B- Without going into detailed calculations, explain what happens for much 
higher values of B,. Starting with what value of wọ (where b and v have the values 
indicated in a) will the results of b no longer be valid ? 


d. Without going into detailed calculations, explain how to calculate the 
disorientation probabilities 7_, ,, and 2_, for an atom A placed in a gas of 
atoms B in thermodynamic equilibrium at the temperature 7, containing a number n 
of atoms per unit volume sufficiently small that only binary collisions need be 
considered. 


+ Ax 


d 5 
N.B. We give : | eae 
_, (+13 16 


9. Transition probability per unit time under the effect 
of a random perturbation. Simple relaxation model 


A physical system, subject to a perturbation W(t), is at time 1 = O in the 
eigenstate | p, > of its Hamiltonian Ho. Let Z, (t) be the probability of finding the 
system at time ¢ in another eigenstate of Ho, | Øp >. The transition probability per 

ae . : d 
unit time w;,s(£) is defined by w,y(1) = F Pt). 


a. Show that, to first order im perturbation theory, we have: 


w; (1) = =| = W(t) Wilt = t)dt + cc. (1) 
0 


with hw; = E, — E, (notation identical to that of chapter XIII). 


b. Consider a very large number . t` of systems (A). which are identical and 
without mutual interactions (A = 1, 2. .... .1'). Each of them has a different 
microscopic environment and, consequently, “sees” a different perturbation W“(z). 
It is. of course, impossible to know with certainty each of the individual pertur- 
bations W")(7); we can specify only statistical averages such as: 


Y WO WAU — x) (2) 
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This perturbation is said to be “random”. 
This random perturbation is called stationary if the preceding averages do 
not depend on the time f. The unperturbed Hamiltonian H, is then redefined so 


as to make all the W,, zero, and we set: 


9 plz) = W(t) W(t =t) (3) 


gy(t) is called the “correlation function” of the perturbation (for the pair of 
states | p; >, | py >). gy; (1) generally goes to zero for t > 1,,is a characteristic time, 
called the “correlation time” of the perturbation. The perturbation has a “ memory ” 
which extends into the past only over an interval of the order of 7.. 


a. The W systems are all in the state | y; > at time ¢ = 0 and are subject 
to a stationary random perturbation, whose correlation function is g,,(t) and 
whose correlation time is 7, (M can be considered to be infinite in the calculations). 

Calculate the proportion 2,,(t) of systems which go into the state | py > per 
unit time. Show that after a certain value £, of t, to be specified, z;p(ż) no longer 
depends on 1. 


p. For fixed t,, how does z,, vary with w,,’? Consider the case for which 
grlt) = |v |? e7, with vp; constant. 


y. The preceding theory is rigorously valid only for t < 7, [since formula (1) 
results from a perturbation theory |]. What is the order of magnitude of £, ? Taking 
t, > t,, find the condition for introducing a transition probability per unit time 
which is independent of ¢ [use the form of g ,;(t) given in the preceding question ]. 
Would it be possible to extend the preceding theory beyond f = t, ? 


c. Application to a simple system. 

The VW systems under consideration are WV spin 1/2 particles, with gyro- 
magnetic ratio y, placed in a static field B, (set w, = — yB,). These particles are 
enclosed in a spherical cell of radius R. Each of them bounces constantly back 
and forth between the walls. The mean time between two collisions of the same 
particle with the wall is called the “flight time” 7,. During this time, the particle 
“sees” only the field By. In a collision with the wall, each particle remains adsorbed 
on the surface during a mean time T, (t, < 1,.), during which it “sees”, in addition 
to B,, a constant microscopic magnetic field b, due to the paramagnetic impurities 
contained in the wall. The direction of b varies randomly from one collision to 
another; the mean amplitude of b is denoted by b,. 


a. What is the correlation time of the perturbation seen by the spins? 
Give the physical justification for the following form, to be chosen for the correlation 
function of the components of the microscopic field b: 


babe = 2) = 3 bbe (a) 


v 


and analogous expressions for the components along Oy and Oz, all the cross 


terms b,(1)b,(t — 1)... being zero. 
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P. Let . 4%, be the component along the Oz axis defined by the field B, of the 
macroscopic magnetization of the .W particles. Show that, under the effect of the 
collisions with the wall, .@, “relaxes”, with a time constant 7,: 


(7, is called the longitudinal relaxation time). Calculate 7, in terms of y, Bo. 
Ti te Do: 

y. Show that studying the variation of 7, with B, permits the experimental 
determination of the mean adsorption time 7,. 

6. We have at our disposition several cells, of different radii R, constructed 
from the same material. By measuring 7,, how can we determine experimentally the 
mean amplitude bh, of the microscopic field at the wall ? 


10. Absorption of radiation 
by a many-particle system forming a bound state. 
The Doppler effect. Recoil energy. The Móssbauer effect 


In complement Axi We consider the absorption of radiation by a charged 
particle attracted by a fixed center O (the hydrogen atom model for which the 
nucleus is infinitely heavy). In this exercise, we treat a more realistic situation, in 
which the incident radiation is absorbed by a system of many particles of finite 
masses interacting with each other and forming a bound state. Thus, we are 
studying the effect on the absorption phenomenon of the degrees of freedom of the 
center of mass of the system. 


l. ABSORPTION OF RADIATION BY A FREE HYDROGEN ATOM. 
THE DOPPLER EFFECT. RECOIL ENERGY 


Let R, and P,, R, and P, be the position and momentum observables of two 
particles, (1) and (2), of masses m, and m, and opposite charges q, and q, (a hydrogen 
atom). Let R and P, R¿and P¿ be the position and momentum observables of the 
relative particle and the center of mass (cf. chap. VH, $ B) M = m, + m, is the 
total mass, and m = m,m,/(m, + m,) is the reduced mass. The Hamiltonian Ho 
of the system can be written: 


H= H, + H, (1) 


where: 


l > 


is the translational kinetic energy of the atom, assumed to be free (“external” 
degrees of freedom), and where /7; (which depends only on R and P) describes the 
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internal energy of the atom (“internal” degrees of freedom). We denote by |K > 


the eigenstates of H,, with eigenvalues h?K?/2M. We concern: ourselves with only 
two eigenstates of H;, | x, > and | y, >. of energies E, and E, (E, > E,). We set: 


E, — E, = hing (3) 


u. What energy must be furnished to the atom to move it from the state | K ; z, > 
(the atom in the state | z, > with a total momentum AK) to the state | K’: 7, >? 


b. This atom interacts with a plane electromagnetic wave of wave vector k 
and angular frequency « = ck, polarized along the unit vector e perpendicular 
to k. The corresponding vector potential A(r. 1) is: 

A(r. t) = estro 4 ec, (4) 


where Y, is a constant. The principal term of the interaction Hamiltonian between 
this plane wave and the two-particle system can be written (cf. complement Axu 
$1-b): 


wi) =- Y P, A(R, 1) (5) 


i=1 i 


Express W(t) in terms of R. P. Ro, P¿. im. M and q (setting q, = — q2 = qh 
and show that. in the electric upole approximation which consists of neglecting 
k . R (but not k . Rg) compared to 1. we have: 


W(t) E We io + Wie: (6) 
where: 


q o 
m 


W = — e. P clk Ra (7) 


c. Show that the matrix element of W between the state | K; 7, > and the 
state | K’: y, > is different from zero only if there exists a certain relation between K. 
k, K’ (to be specified). Interpret this relation in terms of the total momentum conser- 
vation during the absorption of an incident photon by the atom. 


d. Show from this that if the atom in the state | K; z, > is placed in the plane 
wave (4), resonance occurs when the energy hw of the photons associated with the 
incident wave differs from the energy hi», of the atomic transition | 7,5 —> | z, > 
by a quantity ò which is to be expressed in terms of Å, wo. K. k, M, c (since ô is 
a corrective term, we can replace w by cw, in the expression for ò). Show that ò is 
the sum of two terms, one of which, ò|. depends on K and on the angle between K 
and k (the Doppler effect), and the other, 6,, is independent of K. Give a physical 
interpretation of 6, and ò, (showing that ò, is the recoil kinetic energy of the atom 
when, having been initially motionless, it absorbs a resonant photon). 

Show that ò, is negligible compared to 6, when hw, is of the order of 10 eV 
(the domain of atomic physics). Choose, for M, a mass of the order of that of the 
proton (Mc? = 10° eV), and, for |K], a value corresponding to a thermal velocity 
at T = 300 °K. Would this still be true if fio, were of the order of 10° eV (the domain 
of nuclear physics)? 
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TA RECOILLESS ABSORPTION OF RADIATION BY A NUCLEUS VIBRATING 
ABOUT ITS EQUILIBRIUM POSITION IN A CRYSTAL. 
THE MÓSSBAUER EFFECT 


The system under consideration is now a nucleus of mass M vibrating at the 
angular frequency Q about its equilibrium position in a crystalline lattice (the 
Einstein model; cf. complement Ay, $ 2). We again denote by R, and P, the position 
and momentum of the center of mass of this nucleus. The vibrational energy of the 
nucleus is described by the Hamiltonian: 


1 1 
H, = are + 5 M(XG + Y% + Z2) (8) 


which is that of a three-dimensional isotropic harmonic oscillator. Denote by 
| Wr .ny.n, » the eigenstate of H, of eigenvalue (2, + n, + n, + 3/2)hQ. In addition 
to these external degrees of freedom, the nucleus possesses internal degrees of 
freedom with which are associated observables which all commute with R¿ and P,. 
Let H, be the Hamiltonian which describes the internal energy of the nucleus. 
As above, we concern ourselves with two eigenstates of H,, | x, > and | x, >. of 
energies E, and £,, and we set hwg = E, — E, Since hw, falls into the y-ray domain, 
we have, of course: 


Wy > Q (9) 


e. What energy must be furnished to the nucleus to allow it to go from the 
state | Yo oo: Xa > (the nucleus in the vibrational state defined by the quantum 
numbers n, = 0, 7, = 0, n, = 0 and the internal state | y, >) to the state | Yn y 0: Z, >? 


f. This nucleus is placed in an electromagnetic wave of the type defined by (4), 
whose wave vector k is parallel to Ox. It can be shown that, in the electric dipole 
approximation, the interaction Hamiltonian of the nucleus with this plane wave 
(responsible for the absorption of the y-rays) can be written as in (6), with: 


W = 4,5 ¡(k) e*s (10) 


where S,(k) is an operator which acts on the internal degrees of freedom and 
consequently commutes with Rg and Pg. Set s(k) = < xy | Silk) | za >. 

The nucleus is initially in the state | Y, o 0; Xa ». Show that, under the influence 
of the incident plane Wave, a resonance appears whenever hw coincides with one of 
the energies calculated in e, with the intensity of the corresponding resonance 
proportional to |s(k)|? [K Y, 0... 1e** | Wo.0.0 >? where the value of k is to be 
specified. Show, furthermore, that condition (9) allows us to replace k by ky = wo/c 
in the expression for the intensity of the resonance. 


g. We set: 
Tako) = |< pp | E% | Po >|? (11) 


where the states | p, > are the eigenstates of a one-dimensional harmonic oscillator 
of position Xç, mass M and angular frequency Q. 
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a. Calculate 2,(k,) in terms of h, M, Q, ky, n (see also exercise 7 of 


2,2 
complement My). Set € = 5 a 


| hQ. Hint: establish a recurrence relation between 


< p, | e*s | po > and < g,_, lete | po >, and express all the x,(k,) in terms 
of zo(ko), which is to be calculated directly from the wave function of the harmonic 
oscillator ground state. Show that the z,(k,) are given by a Poisson distribution. 


b. Verify that Y m (ko = 1. 
n=0 


y. Show that Y nñQr,(k0) = R%w/2Me?. 


n=0 


h. Assume that hQ > h?w2/2Mc?, that is, that the vibrational energy of the 
nucleus is much greater than the recoil energy (very rigid crystalline bonds). Show 
that the absorption spectrum of the nucleus is essentially composed of a single line 
of angular frequency wy. This line is called the recoilless absorption line. Justify 
this name. Why does the Doppler effect disappear? 


i. Now assume that AQ < h?w2/2Mc? (very weak crystalline bonds). Show 
that the absorption spectrum of the nucleus is composed of a very large number of 
equidistant lines whose barycenter (obtained by weighting the abscissa of each line 
by its relative intensity) coincides with the position of the absorption line of the free 
and initially motionless nucleus. What is the order of magnitude of the width of 
this spectrum (the dispersion of the lines about their barycenter)? Show that one 
obtains the results of the first part in the limit Q —~ 0. 


Exercise 3 : 


References : see Brossel’s lectures in (15.2). 


Exercise 5 : 


References : see Townes and Schawlow (12.10), chap. 10, § 9. 


Exercise 6 : 


References : see Wilson (16.34). 


Exercise 9 : 


References : see Abragam (14.1), chap. VIII: Slichter (14.2), chap. $. 


Exercise 10 : 


References : see De Benedetti (16.23); Valentin (16.1), annexe XV. 
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3. Identical particles in quantum mechanics: the difficulties of 
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B. PERMUTATION OPERATORS 1. Two-particle systems 
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b. Application to systems of two identical particles 
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4. Application of the other postulates 
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of physical predictions 
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b. Situations in which the symmetrization postulate can be 
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In chapter III, we stated the postulates of non-relativistic quantum mechanics. and 
in chapter IX, we concentrated on those which concern spin degrees of freedom. 
Here, we shall see (§A) that, in reality, these postulates are not sufficient when we are 
dealing with systems containing many identical particles since, in this case, their 
application leads to ambiguities in the physical predictions. To eliminate these 
ambiguities, it is necessary to introduce a new postulate, concerning the quantum 
mechanical description of systems of identical particles. We shall state this postulate 
in $C and discuss its physical implications in $D. Before we do so, however, we 
shall (in$B) define and study permutation operators, which considerably facilitate 
the reasoning and the calculations. 


A. STATEMENT OF THE PROBLEM 


1. Identical particles: definition 


Two particles are said to be identical if all their intrinsic properties (mass, spin, 
charge, etc.) are exactly the same : no experiment can distinguish one from the other. 
Thus, all the electrons in the universe are identical, as are all the protons and all the 
hydrogen atoms. On the other hand, an electron and a positron are not identical, 
since, although they have the same mass and the same spin, they have different 
electrical charges. 

An important consequence can be deduced from this definition: when 
a physical system contains two identical particles, there is no change in its properties 
or its evolution if the roles of these two particles are exchanged. 


COMMENT: 


Note that this definition is independent of the experimental conditions. 
Even if, in a given experiment, the charges of the particles are not measured, 
an electron and a positron can never be treated like identical particles. 


1371 


www.elsolucionario.net 


CHAPTER XIV SYSTEMS OF IDENTICAL PARTICLES 


2. Identical particles in classical mechanics 


In classical mechanics, the presence of identical particles in a system poses 
no particular problems. This special case is treated just like the general case. 
Each particle moves along a well-defined trajectory, which enables us to distinguish 
it from the others and “follow” it throughout the evolution of the system. 

To treat this point in greater detail, we shall consider a system of two identical 
particles. At the initial time £,, the physical state of the system is defined by specifying 
the position and velocity of each of the two particles; we denote this initial data 
by [ ro, Vo and { r6, Vo }. To describe this physical state and calculate its evolution, 
we number the two particles : r, (+) and v, (+) denote the position and velocity of 
particle (1) at time £, and r,(t) and v,(1), those of particle (2). This numbering has no 
physical foundation, as it would if we were dealing with two particles having 
different natures. It follows that the initial physical state which we have just defined 
may, in theory, be described by two different “mathematical states” as we can set, 
either : 


rifto) = Fo ra (t0) = To 

Vilto) = Yo Valto) = Vo (A-1) 
or: 

r,(1,) = Fo r>(10) = Fo 

Vi (lo) = Yo Valto) = Vo (A-2) 


Now, let us consider the evolution of the system. Suppose that the solution of 
the equations of motion defined by initial conditions (A-1) can be written: 


r(t) =r(1) r(t) = 5 (0) (A-3) 


where r(1) and r'(£) are two vector functions. The fact that the two particles are 
identical implies that the system is not changed if they exchange roles. Consequently, 
the Lagrangian X(r,, v,; r,, v,) and the classical Hamiltonian #(r,, p,; r2, p,) are 
invariant under exchange of indices | and 2. It follows that the solution of the 
equations of motion corresponding to the initial state (A-2) is: 


ri(t)=r(t) r(t) =r(0) (A-4) 


where the functfons r(t) and r‘(t) are the same as in (A-3). 

The two possible mathematical descriptions of the physical state under consi- 
deration are therefore perfectly equivalent, since they lead to the same physical 
predictions. The particle which started from { rọ, Vo } at £, is at r(t) with the velocity 
v(t) = dr/dt at time r, and the one which started from { rọ, vg } is at r'(1) with the 
velocity v'(t) = dr'/d: (fig. 1). Under these conditions, all we need to do is choose, 
at the initial time, either one of the two possible “mathematical states” and ignore 
the existence of the other one. Thus, we treat the system as if the two particles were 
actually of different natures. The numbers (1) and (2), with which we label them 
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arbitrarily at fy, then act like intrinsic properties to distinguish the two particles. 
Since we can follow each particle step-by-step along its trajectory (arrows in figure 1), 
we can determine the locations of the particle numbered (1) and the one numbered (2) 
at any time. 


{to +% E ONO): 


l Eo a Wo } —______> dee), vio} 


Initial state State at the instant ¢ 
FIGURE | 


Position and velocity of each of the two particles at the initial time /, and at time /. 


3. Identical particles in quantum mechanics: 
the difficulties of applying the general postulates 


a. QUALITATIVE DISCUSSION OF A FIRST SIMPLE EXAMPLE 


It is immediately apparent that the situation is radically different in quantum 
mechanics, since the particles no longer have definite trajectories. Even if, at to. 
the wave packets associated with two identical particles are completely separated in 
space, their subsequent evolution may mix them. We then “lose track” of the 
particles: when we detect one particle in a region of space in which both of them 
have a non-zero position probability, we have no way of knowing if the particle 
detected is the one numbered (1) or the one numbered (2). Except in special cases 
— for example, when the two wave packets never overlap — the numbering of the 
two particles becomes ambiguous when their positions are measured, since, as we 
shall see, there exist several distinct “paths” taking the system from its initial state 
to the state found in the measurement. 

To investigate this point in greater detail, consider a concrete example: 
a collision between two identical particles in their center of mass frame (fig. 2). 
Before the collision, we have two completely separate wave packets, directed towards 
each other (fig. 2-a). We can agree, for example, to denote by (1) the particle on the 
left and by (2), the one on the right. During the collision (fig. 2-b), the two wave 
packets overlap. After the collision, the region of space in which the probability 
density of the two particles is non-zero* looks like a spherical shell whose radius 
increases over time (fig. 2-c). Suppose that a detector placed in the direction which 
makes an angle @ with the initial velocity of wave packet (1) detects a particle. It is then 
certain (because momentum is conserved in the collision) that the other particle is 
moving away in the opposite direction. However, itis impossible to know if the particle 
detected at D is the one initially numbered (1) or the one numbered (2). Thus, there 


* The two-particle wave function depends on six variables (the components of the two particles 
coordinates r and r’) and is not easily represented in 3 dimensions. Figure 2 is therefore very schematic: 
the grey regions are those to which both r and r' must belong for the wave function to take on significant 
values. 
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a b c 


FIGURE 2 


Collision between two identical particles in the center of mass frame : schematic representation 
of the probability density of the two particles. 

Before the collision (fig. a), the two wave packets are clearly separated and can be labeled. 
During the collision (fig. b), the two wave packets overlap. After the collision (fig. c), the probability 
density is non-zero in a region shaped like a spherical shell whose radius increases over time. 
Because the two particles are identical, it is impossible, when a particle is detected at D, to 
know with which wave packet, (1) or (2), it was associated before the collision. 


are two different “paths” that could have led the system from the initial state shown 
in figure 2-a to the final state found in the measurement. These two paths are repre- 
sented schematically in figures 3-a and 3-b. Nothing enables us to determine which 
one was actually followed. 


A fundamental difficulty then arises in quantum mechanics when using the 


postulates of chapter III. In order to calculate the probability of a given measurement 
result it is necessary to know the final state vectors associated with this result. 
Here, there are two, which correspond respectively to figures 3a and 3b. These 
two kets are distinct (and, furthermore, orthogonal). Nevertheless, they are 
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FIGURE 3 


Schematic representation of two types of “paths” which the system could have followed in going 
from the initial state to the state found in the measurement. Because the two particles are 
identical, we cannot determine the path that was actually followed. 
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associated with a single physical state since it is impossible to imagine a more 
complete measurement that would permit distinguishing between them. Under these 
conditions, should one calculate the probability using path 3a, path 3b or both? 
In the latter case, should one take the sum of the probabilities associated with each 
path, or the sum of their probability amplitudes (and in this case, with what sign)? 
These different possibilities lead, as we shall verify later, to different predictions. 

The answer to the preceding questions will be given in $D after we have stated 
the symmetrization postulate. Before going on, we shall study another example 
which will aid us in understanding the difficulties related to the indistinguishability 
of two particles. 


b. ORIGIN OF THE DIFFICULTIES: EXCHANGE DEGENERACY 


In the preceding example, we considered two wave packets which, initially, 
did not overlap, which enabled us to label each of them arbitrarily with a number. 
(1) or (2). Ambiguities appeared, however, when we tried to determine the mathe- 
matical state (or ket) associated with a given result of a position measurement. 
Actually, the same difficulty arises in the choice of the mathematical ket used to 
describe the initial physical state. This type of difficulty is related to the concept of 
“exchange degeneracy” which we shall introduce in this section. To simplify the 
reasoning, we shall first consider a different example, so as to confine ourselves to 
a finite-dimensional space. Then, we shall generalize the concept of exchange 
degeneracy, showing that it can be generalized to all quantum mechanical systems 
containing identical particles. 


a. Exchange degeneracy for a system of two spin 1/2 particles 


Let us consider a system composed of two identical spin 1/2 particles, 
confining ourselves to the study of its spin degrees of freedom. As in $A-2, we shall 
distinguish between the physical state of the system and its mathematical description 
(a ket in state space). 

It would seem natural to suppose that, if we made a complete measurement 
of each of the two spins, we would then know the physical state of the total system 
perfectly. Here. we shall assume that the component along Oz of one of them is 
equal to + h/2 and that of the other one, — A /2 (this is the equivalent for the two 
spins of the specification of { ro, vo } and { r6, vo } in §A-2). 

To describe the system mathematically, we number the particles: S, and S, 
denote the two spin observables, and { | £;i. £, > | (where e, and e, can be equal 
to + or —) is the orthonormal basis of the state space formed by the common eigen- 
kets of S,. (eigenvalue ¢,h/2) and S,. (eigenvalue ¢,h/2). 

Just as in classical mechanics, two different “mathematical states” could be 
associated with the same physical state. Either one of the two orthogonal kets : 


ja = ha =P (A-5-a) 
|e ==.t1= +5 (A-5-b) 
can, a priori, describe the physical state considered here. 


These two kets span a two-dimensional subspace whose normalized vectors 
are of the form: 


tae Pek (A-6) 
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with: 


lal? + |B)? = 1 (A-7) 


By the superposition principle, all mathematical kets (A-6) can represent the same 
physical state as (A-S-a) or (A-5-b) (one spin pointing up and the other one pointing 
down). This is called “exchange degeneracy”. 

Exchange degeneracy creates fundamental difficulties, since application of the 
postulates of chapter III to the various kets (A-6) can lead to physical predictions 
which depend on the ket chosen. Let us determine, for example, the probability of 
finding the components of the two spins along Ox both equal to + h/2. With this 
measurement result is associated a single ket of the state space. According to 
formula (A-20) of chapter IV, this ket can be written: 


a 


v2 


fee eine eee eee ae 


v2 


=> + +> +], +> +| +, -> 1 (48) 


Consequently, the desired probability, for the vector (A-6), is equal to: 


2 


1 
se +0) (A-9) 


This probability does depend on the coefficients a and $. It is not possible, therefore, 
to describe the physical state under consideration by the set of kets (A-6) or by any 
one of them chosen arbitrarily. The exchange degeneracy must be removed. That is, 
we must indicate unambiguously which of the kets (A-6) is to be used. 


COMMENT: 


In this example, exchange degeneracy appears only in the initial state, 
since we chose the same value for the components of the two spins in the final 
state. In the general case (for example, if the measurement result corresponds 
to two different eigenvalues of S,.), exchange degeneracy appears in both the 
initial and the final state. 


B. Generalization 


The difficulties related to exchange degeneracy arise in the study of all systems 
containing an arbitrary number N of identical particles (N > 1). 

Consider, for example, a three-particle system. With each of the three particles, 
taken separately, are associated a state space and observables acting in this space. 
Thus, we are led to number the particles: £(1), (2) and &(3) will denote the three 
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one-particle state spaces, and the corresponding observables will be labeled by the 
same indices. The state space of the three-particle system is the tensor product: 


£ =£(1)9 6(2) Q 8(3) (A-10) 


Now, consider an observable B(1), initially defined in &(1). We shall assume 
that B(1) alone constitutes a C.S.C.O. in &(1) [or that B(1) actually denotes several 
observables which form a C.S.C.O. ]. The fact that the three particles are identical 
implies that the observables B(2) and B(3) exist and that they constitute C.S.C.O.’s 
in &(2) and €(3) respectively. B(1), B(2) and B(3) have the same spectrum, 
{b,;n = 1, 2,... ). Using the bases which define these three observables in &(1), 
8 (2) and 8 (3), we can construct, by taking the tensor product, an orthonormal basis 
of &, which we shall denote by: 


{|1:b;2:b33:b,);hj k= 1,2... } (A-11) 


The kets | 1 : b;; 2 : b,; 3 : b, > are common eigenvectors of the extensions of B(1), 
B(2) and B(3) in 6, with respective eigenvalues b;, b; and b,. 

Since the three particles are identical, we cannot measure B(1) or B(2) or B(3), 
since the numbering has no physical significance. However, we can measure the 
physical quantity B for each of the three particles. Suppose that such a measurement 
has resulted in three different eigenvalues, b,, b, and b,. Exchange degeneracy then 
appears, since the state of the system after this measurement can, a priori, be 
represented by any one of the kets of the subspace of & spanned by the six basis 
vectors: 


Therefore, a complete measurement on each of the particles does not permit the 
determination of a unique ket of the state space of the system. 


COMMENT. 


The indeterminacy due to exchange degeneracy is, of course, less 
important if two of the eigenvalues found in the measurement are equal. This 
indeterminacy disappears in the special case in which the three results are 
identical. 


B. PERMUTATION OPERATORS 


Before stating the additional postulate which enables us to remove the inde- 
terminacy related to exchange degeneracy, we shall study certain operators, defined 
in the total state space of the system under consideration, which actually permute 
the various particles of the system. The use of these permutation operators will 
simplify the calculations and reasoning in §§ C and D. 
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1. Two-particle systems 


a. DEFINITION OF THE PERMUTATION OPERATOR P), 


Consider a system composed of two particles with the same spin s. Here it is 
not necessary for these two particles to be identical; it is sufficient that their individual 
state spaces be isomorphic. Therefore, to avoid the problems which arise when 
the two particles are identical, we shall assume that they are not: the numbers (1) 
and (2) with which they are labeled indicate their natures. For example, (1) will 
denote a proton and (2), an electron. 

We choose a basis, { | u; > }, in the state space &(1) of particle (1). Since the 
two particles have the same spin, &(2) is isomorphic to &(1), and it can be spanned 
by the same basis. By taking the tensor product, we construct, in the state space & of 
the system, the basis: 

Ca 2 tae} (B-1) 
Since the order of the vectors is of no importance in a tensor product, we have: 

ERASE AS Se (B-2) 
However, note that: 

|Llzu2:0, >| 1:0,:2:0,> if iff (B-3) 

The permutation operator P,, is then defined as the linear operator whose 
action on the basis vectors is given by: 

Py lata = lia = [lees (B-4) 


Its action on any ket of & can easily be obtained by expanding this ket on the 
basis (B-1)*. 


COMMENT: 


If we choose the basis formed by the common eigenstates of the position 
observable R and the spin component S,, (B-4) can be written: 


Pallina2:ir, } = li ardor es (B-5) 


Any ket | y > of the state space & can be represented by a set of (2s + 1)? functions of 
six variables: 


Y=} [era p er) dire 2: 0) (B-6) 
with: 
Y, (rr) =< lin 062: re [wd (B-7) 


* It can easily be shown that the operator P,, so defined does not depend on the { | u, > | basis 
chosen. 
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We then have: 


Pu le? =, [er Pra, Are) [ lin ves 2:ne> (B-8) 


By changing the names of the dummy variables: 


Ec E 


r <>r (B-9) 


we transform formula (B-8) into: 
Pr Iv>= Y fer ary, (rir) [lines 2: re> (B-10) 


Consequently, the functions: 
wire’) = Olives 2:9 e | Py |w> (B-11) 


which represent the ket |y > = P,, |W) can be obtained from the functions (B-7) 
which represent the ket | y > by inverting (r, e) and (r', £’): 


Wielt, e) = w(t’ r) (B-12) 


b. PROPERTIES OF P,, 
We see directly from definition (B-4) that: 
(Pa)? =1 (B-13) 


The operator P,, is its own inverse. 
It can easily be shown that P,, ts Hermitian: 


P3, = Pry (B-14) 


The matrix elements of P,, in the {| 1 :u,; 2 : u; > } basis are: 


Cliupi2iuy | Poy | lays 2 iu; = Cl ius 2 sup | liuj: u> 
25.8, (B-15) 


Those of P!, are, by definition: 


Cl supe 2 ou, | Pt |1 tes a) = Ta Py | i 2 ae >) 
= (1:42: 14,2 24,3) 
Dd (B-16) 


Each of the matrix elements of P}, is therefore equal to the corresponding matrix element of P,,. 
This leads to relation (B-14). 


It follows from (B-13) and (B-14) that P,, is also unitary: 
P} Pai = P,P}, =| (B-17) 
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c. SYMMETRIC AND ANTISYMMETRIC KETS. 
SYMMETRIZER AND ANTISYMMETRIZER 


According to (B-14), the eigenvalues of P,, must be real. Since, according to 
(B-13), their squares are equal to 1, these eigenvalues are simply + 1 and — 1. The 
eigenvectors of P,, associated with the eigenvalue + | are called symmetric, those 
corresponding to the eigenvalue — 1, antisymmetric: 


Paulvs>= |Ws> => | ýs > symmetric 
Pai |¥,> =- | Ya? == | y4) antisymmetric (B-18) 


Now consider the two operators : 
1 


S =>(1 + Pz) (B-19-a) 
1 
These operators are projectors, since (B-13) implies that: 
S =S (B-20-a) 
A? =A (B-20-b) 
and, in addition, (B-14) enables us to show that: 
sS=sS (B-21-a) 
A =A (B-21-b) 


S and A are projectors onto orthogonal subspaces, since, according to (B-13): 

SA = AS =0 (B-22) 
These subspaces are supplementary, since definitions (B-19) yield: 

S+A=1 (B-23) 


If | y > is an arbitrary ket of the state space £, S | yw > is a symmetric ket and 
A | y >, an antisymmetric ket, as it is easy to see, using (B-13) again, that: 

PasIiv>=S|Yw> 

PA AIV>= -—AlY> (B-24) 


For this reason, S and A are called, respectively, a symmetrizer and an antisymme- 
trizer. 


COMMENT: 
The same symmetric ket is obtained by applying Sto P,, | y > orto | y > 

itself: 

SPA Ivw>=S|Y> (B-25) 

For the antisymmetrizer, we have, similarly: 

AP,,|W>=-AlW> (B-26) 
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d. TRANSFORMATION OF OBSERVABLES BY PERMUTATION 


Consider an observable B(1), initially defined in &(1) and then extended into é. 
It is always possible to construct the { | u; > } basis in &(1) from eigenvectors of B(1) 
(the corresponding eigenvalues will be written 5,). Let us now calculate the action of 
the operator P,,B(1)P!, on an arbitrary basis ket of £: 


PA B(1)P}, |1 :u;;2:u;) = P2,B(I)| 1 :u,:2:u,> 
bPa | ius 2 > (B-27) 
= b,| l:iu; 2:u; >» 


We would obtain the same result by applying the observable B(2) directly to the basis 
ket chosen. Consequently: 


P,,B(1)P}, = B(2) (B-28) 
The same reasoning shows that: 
P,,B(2)P3, = B(1) (B-29) 


In &, there are also observables, such as B(1) + C(2) or B(1)C(2), which 
involve both indices simultaneously. We obviously have: 

Pa [B(1) + C(2)]P2, = B(2) + C(I) (B-30) 
Similarly, using (B-17), we find: 

P,,B(I)C(2)P 3, = P,,BUI)P3,P2,C(2)P3, 

= B(2)C(1) (B-31) 

These results can be generalized to all observables in $ which can be expressed in 
terms of observables of the type of B(1) and C(2), to be denoted by ( (1, 2): 

P>¡€ (1, 2)P3, = ©(2, 1) (B-32) 
O (2, 1)is the observable obtained from ( (1, 2) by exchanging indices | and 2 through- 


out. 
An observable ©,(1, 2) is said to be symmetric if: 


Os(2, 1) = Os(1. 2) (B-33) 


According to (B-32), all symmetric observables satisfy : 


P),¡0s(1, 2) = Cs(1, 2)P,, (B-34) 
that is : 
[Os(1, 2), Py] = 0 (B-35) 


Symmetric observables commute with the permutation operator. 
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2. Systems containing an arbitrary number of particles 


In the state space of a system composed of N particles with the same spin 
(temporarily assumed to be of different natures), N! permutation operators can be 
defined (one of which is the identity operator). If N is greater than 2, the properties 
of these operators are more complex than those of P,,. To have an idea of the changes 
involved when N is greater than 2, we shall briefly study the case in which N = 3. 


a. DEFINITION OF THE PERMUTATION OPERATORS 


Consider, therefore, a system of three particles which are not necessarily 
identical but have the same spin. As in $ B-1-a, we construct a basis of the state space 
of the system by taking a tensor product: 


AE A ao u>} (B-36) 
In this case, there exist six permutation operators, which we shall denote by: 


Pi P312» P331, Pi32 P313» P321 (B-37) 


By definition, the operator P„p (where n, p, q is an arbitrary permutation of the 
numbers 1, 2, 3) is the linear operator whose action on the basis vectors obeys: 


Papa [Lili iu) lA A qu)» (B-38) 
For example: 


Paza | liu 2 iu 3 iu) ES UL SAO 
=|i4 2:46:34, (B-39) 


P,,3 therefore coincides with the identity operator. The action of P,,,, on any ket 
of the state space can easily be obtained by expanding this ket on the basis (B-36). 

The N! permutation operators associated with a system of N particles with 
the same spin could be defined analogously. 


b. PROPERTIES 
%. The set of permutation operators constitutes a group 
This can easily be shown for the operators (B-37): 


(i) P,,, is the identity operator. 


(ii) The product of two permutation operators is also a permutation operator. We can 
show, for example, that: 


PsP 132 = Psat (B-40) 


To do so, we apply the left-hand side to an arbitrary basis ket: 
PsP 132 11004322433 :4,) =P312|1:0,53:4,52:4,> 
= Py). |1iuj;32:4,5324;> 
[sa lod) 
=|1:4,32:4,53 :4,> (B-41) 


I 
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The action of P,,, effectively leads to the same result: 
Py] 1:02:03 :4,>)=]|3:0,2:4,51:4> 
=|1:4,2:4,53 :4,> (B-42) 


(iii) Each permutation operator has an inverse, which is also a permutation operator. 
Reasoning as in (/i), we can easily show that: 


Pro's = Pins Pih = Pasis Pii = Pan 
Pisz = Pis Pas = Pas: P32) = Psat (B-43) 


Note that the permutation operators do not commute with each other. 


For example: 
Pi32P312 = Pass (B-44) 


which, compared to (B-40), shows that the commutator of P,,, and P;,, is not zero. 


p. Transpositions. Parity of a permutation operator 


A transposition is a permutation which simply exchanges the roles of two of 
the particles, without touching the others. Of the operators (B-37), the last three 
are transposition operators*. Transposition operators are Hermitian, and each 
of them is the same as its inverse, so that they are also unitary [the proofs of these 
properties are identical to those for (B-14), (B-13) and (B-17)]. 

Any permutation operator can be broken down into a product of transposition 
operators. For example, the second operator (B-37) can be written: 


P312 = PsP 213 = P321 P132 = Po13P 321 az PosPruslP ir = (B-45) 


This decomposition is not unique. However, for a given permutation, it can be 
shown that the parity of the number of transpositions into which it can be broken 
down is always the same : it is called the parity of the permutation. Thus, the first 
three operators (B-37) are even, and the last three, odd. For any N, there are 
always as many even permutations as odd ones. 


y. Permutation operators are unitary 


Permutation operators, which are products of transposition operators, all 
of which are unitary, are therefore also unitary. However, they are not necessarily 
Hermitian, since transposition operators do not generally commute with each 
other. 

Finally, note that the adjoint of a given permutation operator has the same 
parity as that of the operator, since it is equal to the product of the same transposition 
operators, taken in the opposite order. 


* Of course, for N = 2, the only permutation possible is transposition. 
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c. COMPLETELY SYMMETRIC OR ANTISYMMETRIC KETS. 
SYMMETRIZER AND ANTISYMMETRIZER 


Since the permutation operators do not commute for N > 2, it is not possible 
to construct a basis formed by common eigenvectors of these operators. 
Nevertheless, we shall see that there exist certain kets which are simultaneously 
eigenvectors of all the permutation operators. 

We shall denote by P, an arbitrary permutation operator associated with 
a system of N particles with the same spin; a represents an arbitrary permutation 
of the first N integers. A ket | 45 > such that: 


P,|¥s> = |Ws> (B-46) 


for any permutation P,, is said to be completely symmetric. Similarly, a completely 
antisymmetric ket | y, > satisfies, by definition* : 


P, | Ya) =& |Wa> (B-47) 


where: 


€, = + 1 if P, is an even permutation 


a 
€, = — l if P, is an odd permutation (B-48) 
The set of completely symmetric kets constitutes a vector subspace 6, of the state 
space &; the set of completely antisymmetric kets, a subspace & ,. 

Now consider the two operators: 


1 

Sey Ei (B-49) 
1 

A= Wi y EPa (B-50) 


where the summations are performed over the N! permutations of the first N 
integers, and e, is defined by (B-48). We shall show that S and A are the projectors 
onto &, and &, respectively. For this reason, they are called a symmetrizer and an 
antisymmetrizer. 

S and A are Hermitian operators: 


St=S (B-51) 
A'=A (B-52) 


The adjoint P! of a given permutation operator is, as we saw above (cf. §B-2-b-y), another 
permutation operator, of the same parity (which coincides, furthermore, with P; *). Taking the 
adjoints of the right-hand sides of the definitions of Sand A therefore amounts simply to changing 
the order of the terms in the summations (since the set of the P; ' is again the permutation group). 


* According to the property stated in § B-2-b-B. this definition can also be based solely on the 
transposition operators : any transposition operator leaves a completely symmetric ket invariant and 
transforms a completely antisymmetric ket into its opposite. 
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Also, if P, is an arbitrary permutation operator, we have: 


P,S = SP, =S (B-53-a) 
PLA = AP, =8,A (B-53-b) 


This is due to the fact that P, P, is also a permutation operator: 


Er (B-54) 
such that: 
A (B-55) 


If, for P, fixed, we choose successively for P, all the permutations of the group, we see that the Py 
are each identical to one and only one of these permutations (in, of course, a different order). 
Consequently: 


1 1 

Pas = 312 PaPa DMP” =$ (B-56-a) 
1 1 

P, A = a Y EP o Po = al Eno 2 EgPg = EnA (B-56-b) 


Similarly, we could prove analogous relations in which S and A are multiplied by P,, from the 
right. 


From (B-53), we see that: 


S =S 
A?=A (B-57) 


and, moreover : 
AS = SA =0 (B-58) 


This is because: 
s? a ee S 
N! N! 
A ra T (B-59) 
N a E & N A a 
as each summation includes N! terms; furthermore : 
1 1 
AS = È PS = qi Le =0 (B-60) 


since half the e, are equal to + 1 and half equal to — 1 (cf. $ B-2-b-B). 


S and A are therefore projectors. They project respectively onto $, and 6, 
since, according to (B-53), their action on any ket | y > of the state space yields 
a completely symmetric or completely antisymmetric ket: 


Pa, S| w> 
P,,A|W> 


s|w> (B-61-a) 
e,,4|w> (B-61-b) 
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COMMENTS: 


(i) The completely symmetric ket constructed by the action of S on P,| >, 
where P, is an arbitrary permutation, is the same as that obtained from | y >, 
since expressions (B-53) indicate that: 


SP,|¥>=S|¥> (B-62) 
As for the corresponding completely antisymmetric kets, they differ at most 
by their signs : 

AP,|W> =8,A|w> (B-63) 


(ii) For N > 2, the symmetrizer and antisymmetrizer are not projectors onto 
supplementary subspaces. For example, when N = 3, it is easy to obtain 
[by using the fact that the first three permutations (B-37) are even and the 
others odd] the relation: 

1 

S +A =3(Piz + Posi + P312) #1 (B-64) 
In other words, the state space is not the direct sum of the subspace 6, of 
completely symmetric kets and the subspace $, of completely antisymmetric 
kets. 


d. TRANSFORMATION OF OBSERVABLES BY PERMUTATION 


We have indicated ($ B-2-b-f) that any permutation operator of an N-particle 
system can be broken down into a product of transposition operators analogous 
to the operator P,, studied in $ B-1. For these transposition operators, we can 
use the arguments of $B-1-d to determine the behavior of the various observables 
of the system when they are multiplied from the left by an arbitrary permutation 
operator P, and from the right by P}. 

In particular, the observables @,(1, 2, ..., N), which are completely symmetric 
under exchange of the indices 1, 2, ..., N, commute with all the transposition 
operators, and, therefore, with all the permutation operators: 


[@,(1, 2, ..., N), P,] =0 (B-65) 


C. THE SYMMETRIZATION POSTULATE 


1. Statement of the postulate 


When a system includes several identical particles, only certain kets 
of its state space can describe its physical states. Physical kets are, depending 
on the nature of the identical particles, either completely symmetric or 


completely antisymmetric with respect to permutation of these particles. 
Those particles for which the physical kets are symmetric are called bosons, 
and those for which they are antisymmetric, fermions. 
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The symmetrization postulate thus limits the state space for a system of 
identical particles. This space is no longer, as it was in the case of particles of 
different natures, the tensor product & of the individual state spaces of the particles 
constituting the system. It is only a subspace of £, &, or &,, depending on whether 
the particles are bosons or fermions. 

From the point of view of this postulate, particles existing in nature are 
divided into two categories. All currently known particles obey the following 
empirical rule*: particles of half-integral spin (electrons, positrons, protons, neu- 
trons, muons, etc.) are fermions, and particles of integral spin (photons, 
mesons, etc.) are bosons. 


COMMENT: 


Once this rule has been verified for the particles which are called “elementary”, 
it holds for all other particles as well, inasmuch as they are composed of these elementary 
particles. Consider a system of many identical composite particles. Permuting two of 
them is equivalent to simultaneously permuting all the particles composing the first one 
with the corresponding particles (necessarily identical to the aforementioned ones) of 
the second one. This permutation must leave the ket describing the state of the system 
unchanged if the composite particles being studied are formed only of elementary bosons 
or if each of them contains an even number of fermions (no sign change, or an even 
number of sign changes); in this case, the particles are bosons. On the other hand, compo- 
site particles containing an odd number of fermions are themselves fermions (an odd 
number of sign changes in the permutation). Now, the spin of these composite particles 
is necessarily integral in the first case and half-integral in the second one (chap. X, $ C-3-c). 
They therefore obey the rule just stated. For example, atomic nuclei are known to be 
composed of neutrons and protons, which are fermions (spin 1/2). Consequently, nuclei 
whose mass number A (the total number of nucleons) is even are bosons, and those whose 
mass number is odd are fermions. Thus, the nucleus of the ¿He isotope of helium is 
a fermion, and that of the *He isotope, a boson. 


2. Removal of exchange degeneracy 


We shall begin by examining how this new postulate removes the exchange 
degeneracy and the corresponding difficulties. 

The discussion of $ A can be summarized in the following way. Let | u > be 
a ket which can mathematically describe a well-defined physical state of a system 
containing N identical particles. For any permutation operator P,, P, |u > can 
describe this physical state as well as | u >. The same is true for any ket belonging 
to the subspace é, spanned by |u > and all its permutations P, | u >. Depending 
on the ket |u > chosen, the dimension of £, can vary between 1 and N!. If this 
dimension is greater than l, several mathematical kets correspond to the same 
physical state: there is then an exchange degeneracy. 


* The “spin-statistics theorem”, proven in quantum field theory, makes it possible to consider 
this rule to be a consequence of very general hypotheses. However, these hypotheses may not all be 
correct, and discovery of a boson of half-integral spin or a fermion of integral spin remains possible. 
It is not inconceivable that, for certain particles, the physical kets might have more complex symmetry 
properties than those envisaged here. 
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The new postulate which we have introduced considerably restricts the class 
of mathematical kets able to describe a physical state : these kets must belong 
to &, for bosons and to &, for fermions. We shall be able to say that the difficulties 
related to exchange degeneracy are eliminated if we can show that $, contains 
a single ket of &, or a single ket of & 4. 

To do so, we shall use the relations S = SP, or A = e, AP,, proven in (B-53). 
We obtain: 


S|u> =SP,|u> (C-1-a) 
A|lu> =e8,AP, | u > (C-1-b) 
These relations express the fact that the projections onto &, and &, of the various 
kets which span é, and, consequently, of all the kets of &,, are collinear. The 
symmetrization postulate thus unambiguously indicates (to within a constant 


factor) the ket of &, which must be associated with the physical state considered : 
S | u > for bosons and A | u > for fermions. This ket is called the physical ket. 


COMMENT: 


It is possible for all the kets of £ „to have a zero projection onto & 4 (or és). 
In this case, the symmetrization postulate excludes the corresponding physical 
state. Later (§§3-b and 3-c), we shall see examples of such a situation when 
dealing with fermions. 


3. Construction of physical kets 


a. THE CONSTRUCTION RULE 


The discussion of the preceding section leads directly to the following 
rule for the construction of the unique ket (the physical ket) corresponding to a 
given physical state of a system of N identical particles: 

(i) Number the particles arbitrarily, and construct the ket | > corresponding 
to the physical state considered and to the numbers given to the particles. 

(ii) Apply S or A to | u >, depending on whether the identical particles are 
bosons or fermions. 

(iii) Normalize the ket so obtained. 


We shall describe some simple examples which illustrate this rule. 


b. APPLICATION TO SYSTEMS OF TWO IDENTICAL PARTICLES 


Consider a system composed of two identical particles. Suppose that one of 
them is known to be in the individual state characterized by the normalized 
ket | p >, and the other one, in the individual state characterized by the normalized 
ket | x >. 

First of all, we shall envisage the case in which the two kets, 
are distinct. The preceding rule is applied in the following way: 


g > and | xz >, 
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(i) We label with the number 1, for example, the particle in the state | p >, 
and with the number 2, the one in the state | z >. This gives: 


[u> =|1l:@:2:x)> (C-2) 


(ii) We symmetrize | u > if the particles are bosons: 

S|u>=S[|t:@:2:4> +] 1:7:2:9)] (C-3-a) 
We antisymmetrize | u > if the particles are fermions: 

AJu)=>3[ 1003232) —|1iz:2:0)) (C-3-b) 


(iii) The kets (C-3-a) and (C-3-b), in general, are not normalized. If we 
assume | p > and | Z > to be orthogonal, the normalization constant is very simple 
to calculate. All we have to do to normalize S | u > or A | u > is replace the factor 1/2 
appearing in formulas (C-3) by 1/\/ 2. The normalized physical ket, in this case, can 
therefore be written: 


1 
losx> =—=I[|1:0:2:x>+el1:x:2:0)] (C-4) 
Ja 


with s = + 1 for bosons and — 1 for fermions. 


We shall now assume that the two individual states, | y > and |z >, are 


identical: 

ly>=|x> (C-5) 
(C-2) then becomes: 

Ju>=|1:p:2:p> (C-6) 


| u > is already symmetric. If the two particles are bosons, (C-6) is then the physical 
ket associated with the state in which the two bosons are in the same individual 
state | p >. If, on the other hand, the two particles are fermions, we see that: 


Alu) =4[] 1: 9:2: 9) —|1:9:2: 9] =0 (C-7) 


Consequently, there exists no ket of &, able to describe the physical state in which 
two fermions are in the same individual state | gy >. Such a physical state is therefore 
excluded by the symmetrization postulate. We have thus established, for a special 
case, a fundamental result known as “Pauli's exclusion principle”: two identical 
fermions cannot be in the same individual state. This result has some very important 
physical consequences which we shall discuss in $ D-1. 


c. GENERALIZATION TO AN ARBITRARY NUMBER OF PARTICLES 


These ideas can be generalized to an arbitrary number N of particles. To see 
how this can be done, we shall first treat the case N = 3. 
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Consider a physical state of the system defined by specifying the three 


individual normalized states | y >, | x > and | œ >. The state | u > which enters into 
the rule of $ a can be chosen in the form: 


[uò =|1:0;2:%:3:%>) (C-8) 


We shall discuss the cases of bosons and fermions separately. 


ol. The case of bosons 


The application of S to | u > gives: 
1 
S|u> =i Pel we? 


1 
=a Ll 1:0:2:%;3:0> +|1:9;2:0;3:x>) +]1:x:2:033:0> 
+|1:0;2:0;3:x%> +|1:x32:033:0>+|1:0:2:133:0 >] 
(C-9) 
It then suffices to normalize the ket (C-9). 
First of all, let us assume that the three kets | o >, | y > and | œ > are orthogonal. 
The six kets appearing on the right-hand side of (C-9) are then also orthogonal. 
To normalize (C-9), all we must do is-replace the factor 1/6 by 1/6. 
If the two states | p > and | y > coincide, while remaining orthogonal to | œ >, 
only three distinct kets now appear on the right-hand side of (C-9). It can easily 
be shown that the.normalized physical ket can then be written: 


1 
LaF :0:2:033:0>) 


+|11:032:0;3:0>+|1:0:2:0;:3:09 >] (C-10) 


Finally, if the three states | o>, lx), | y > are the same, the ket: 


luy =| 152% e238 a> (C-11) 
is already symmetric and normalized. 
p. The case of fermions 


The application of A to | u >.gives: 
l 
Alu> =a 282. 11:0532:133:0) (C-12) 


The signs of the various terms of the sum (C-12) are determined by the same rule 
as those of a 3 » 3 determinant. This is why it is convenient to write A | u> 
in the form of a Slater determinant : 
i [L:p> |1:y%> |l:@> 
A|u>=>3,|12:0> ESPA (C-13) 
13:9> |3:x> |3:0> 
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Ali z > or | w > coincide, since 
the determinant (C-13) then has two identical columns. We obtain Pauli's exclusion 
principle. already mentioned in §C-3-b: the same quantum mechanical state cannot 
be simultaneously occupied by several identical fermions. 

Finally, note that if the three states | @ >, 
six kets appearing on the right-hand side of (C-12) are orthogonal. All we must then 
do to normalize A | u > is replace the factor 1/3! appearing in (C-12) or (C-13) 
by I/v 3!. 

If, now, the system being considered contains more than three identical 
particles, the situation actually remains similar to the one just described. It can be 
shown that, for N identical bosons, it is always possible to construct the physical 
state S | u > from arbitrary individual states | y >, , .. On the other hand, for 
fermions, the physical ket A | «> can be written in the form of an N x N Slater 
determinant: this excludes the case in which two individual states coincide (the 
ket A | u > is then zero). This shows, and we shall return to this in detail in $D, how 
different the consequences of the new postulate can be for fermion and boson 
systems. 


d. CONSTRUCTION OF A BASIS IN THE PHYSICAL STATE SPACE 


Consider a system of N identical particles. Starting with a basis, {|u; >}, 


in the state space of a single particle, we can construct the basis: 
EA A ND | 


in the tensor product space &. However, since the physical state space of the system 
is not $. but, rather. one of the subspaces. $ or £4. the problem arises of how to 
determine a basis in this physical state space. 

By application of S (or 4) to the various kets of the basis: 


A A E 


we can obtain a set of vectors spanning 65 (or 6 ,). Let | y > be an arbitrary ket 
of 6 ,. for example (the case in which | p > belongs to 6 , can be treated in the same 
way). | p >, which belongs to £, can be expanded in the form: 


lp > = ae aj...» | 1:42 20 a N 234) (C-14) 


Since | y >, by hypothesis, belongs to £y, we have S| p > = | p >, and we simply 
apply the operator S to both sides of (C-14) to show that | > can dd expressed 
in the form of a linear combination of the various kets S | 1: u;: 2 : EN ZA 

However, it must be noted that the various kets S | loa, 2: Us... Nou, > are 
not independent. Let us permute the roles of the various particles in one of the 
kets | 1 :u,:2 :0,;...: N : u, > of the initial basis (before symmetrization). On this 
new ket, application of S or A leads, according to (B-62) and (B-63), to the same 
ket of é; or 6, (possibly with a change of sign). 

Thus, we are led to introduce the concept of an occupation number: by 
definition, for the ket | I : u,:2 : u: ...: N : u, >, the occupation number n, of the 
individual state | u, > is Sail to the number of times that the state | u, > appears in the 
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sequence { . | u, >? } that is, the number of particles in the state | u, > 


we have, obviously, >, n, = N). Two different kets | l:4,32: UN AA > for 
k 


which the occupation numbers are equal can be obtained from each other by the 
action of a permutation operator. Consequently, after the action of the symmetrizer S 
(or the antisymmetrizer 4), they give the same physical state, which we shall denote 
AA E 


[nis Ags rere 
=ceS|liu; iu A H A Ay + Aiu.) 
sy Fe 
n, particles n, particles 
in the state | v, > in the state | u, > 


For fermions, S would be replaced by 4 in (C-15) (c is a factor which permits the 
normalization of the state obtained in this way*). We shall not study the states 
| N,, Nz, -.., Ny, ... > in detail here; we shall confine ourselves to giving some of their 
important properties: 


(i) The scalar product of two kets | n, n3, -= Aps -= > and | mj, 15, ..., Mj. > 
is different from zero only if all the occupation numbers are equal (n, = n; for 
all k). 


By using (C-15) and definitions (B-49) and (B-50) of S and A, we can obtain the expansion 
i i Hp iuj Niu) 
It is then easy to see that, if the occupation numbers are not all equal, these two kets cannot 
simultaneously have non-zero components on the same basis vector. 


(ii) If the particles under study are bosons, the kets | 74, n3, ..., Aps ... >, in which 
the various occupation numbers n, are arbitrary | with, of course 2m = = N), 


form an orthonormal basis of the physical state space. 


Let us show that, for bosons, the kets |74, 12, ..., My. ... X defined by (C-15) are never 
zero. To do so, we replace S by its definition (B-49). There then appear, on the right-hand side 
of (C-15), various orthogonal kets |1 :u,;2:uj;;...;N:u, >, all with positive coefficients. 
| 21, Mz, ..., My, ... » Cannot, therefore, be zero. 

The | n4, A3, ..., Az, ... > form a basis in 6, since these kets span $5, are all non-zero, and 
are orthogonal to each other. 


(iii) If the particles under study are fermions, a basis of the physical state 
space $, is obtained by choosing the set of kets | n,, n3, ..., My. ... > in which all the 


occupation numbers are equal either to 1 or to 0 (again with Y n, = N). 
k 


* A simple calculation yields : ¢ = = /Ntn, ! ny! ... for bosons and JN! for fermions. 
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The preceding proof is not applicable to fermions because of the minus signs which 
appear before the odd permutations in definition (B-50) of A. Furthermore, we saw in $c that 
two identical fermions cannot occupy the same individual quantum state : if any one of the 
occupation numbers is greater than 1, the vector defined by (C-15) is zero. On the other hand, 
it is never zero if all the occupation numbers are equal to one or Zero, since two particles are then 
never in the same individual quantum state, so that the kets |1 :u,;2:u,;...;N:u, > and 
P,|1:u,;2:u;;...3.N:u, > are always distinct and orthogonal. Relation (C-15) therefore 
defines a non-zero physical ket in this case. The rest of the proof is the same as for bosons. 


4. Application of the other postulates 


It remains for us to show how the general postulates of chapter III can be 
applied in light of the symmetrization postulate introduced in $ C-1, and to verify 
that no contradictions arise. More precisely, we shall see how measurement 
processes can be described with kets belonging only to either £; or &,. and we 
shall show that the time evolution process does not take the ket | (1) > associated 
with the state of the system out of this subspace. Thus, all the quantum mechanical 
formalism can be applied inside either €, or £4. 


a. MEASUREMENT POSTULATES 
a. Probability of finding the system in a given physical state 


Consider a measurement performed on a system of identical particles. The 
ket | y(t) > describing the quantum state of the system before the measurement 
must, according to the symmetrization postulate, belong to $, or to & ,, depending 
on whether the system is formed of bosons or fermions. To apply the postulates of 
chapter III concerning measurements, we must take the scalar product of | y(t) > 
with the ket lu>» corresponding to the physical state of the system after the 
measurement. This ket |u> is to be constructed by applying the rule given 
in §C-3-a. The probability amplitude < u | Y(t) > can therefore be expressed in 
terms of two vectors, both belonging either to &; or to £,. In $D-2, we shall 
discuss a certain number of examples of such calculations. 

If the measurement envisaged is a “complete” measurement (yielding, for 
example, the positions and spin components S, for all the particles), the physical 
ket |u > is unique (to within a constant factor). On the other hand, if the 
measurement is “incomplete” (for example, a measurement of the spins only, or a 
measurement bearing on a single particle), several orthogonal physical kets are 
obtained, and the corresponding probabilities must then be summed. 


B. Physical observables; invariance of 65 and € , 


In certain cases, it is possible to specify the measurement performed on the 
system of identical particles by giving the explicit expression of the corresponding 
observable in terms of R,, P,,S,,R,, P}, S,, etc. 

We shall give some concrete examples of observables which can be measured 
in a three-particle system: 

— Position of the center of mass R¿, total momentum P and total angular 
momentum L: 
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Ro =3(R, +R, +R;) (C-16) 
P =P, +P, +P, (C-17) 
Lek fis, (C-18) 


— Electrostatic repulsion energy: 


2 
Ww Ela a 20 a) (C-19) 
4ncy\|R, — Ral |R} — Ri] |R; — R| 
— Total spin: 
S = S, +S, +S, (C-20) 


etc. 

It is clear from these expressions that the observables associated with the 
physical quantities considered involve the various particles symmetrically. This 
important property follows directly from the fact that the particles are identical. 
In (C-16), for example, R,, R, and R, have the same coefficient, since the three 
particles have the same mass. It is the equality of the charges which is at the basis 
of the symmetrical form of (C-19). In general, since no physical properties are 
modified when the roles of the N identical particles are permuted, these N particles 
must play a symmetrical role* in any actually measurable observable. Mathema- 
tically, the corresponding observable G, which we shall call a physical observable, 
must be invariant under all permutations of the N identical particles. It must 
therefore commute with all the permutation operators P, of the N particles 
(cf. $ B-2-d): 


[G, P,] =0 forall P, (C-21) 
For a system of two identical particles, for example, the observable R, — R, 
(the vector difference of the positions of the two particles), which is not invariant 
under the effect of the permutation P,, (R, — R, changes signs) is not a physical 
observable; indeed. a measurement of R, — R, assumes that particle (1) can be 
distinguished from particle (2). On the other hand, we can measure the distance 


between the two particles, that is, y (R, — R,)’, which is symmetrical. 

Relation (C-21) implies that &, and 6, are both invariant under the action 
of a physical observable G. Let us show that, if | y > belongs to 6,4, G | y > also 
belongs to $, (the same proof also applies, of course, to 6s). The fact that | y > 
belongs to £, means that: 


PL Ivo? =e | v> (C-22) 
Now let us calculate P,G | y >. According to (C-21) and (C-22), we have: 
P,G|W>=GP,|~> = 6 |4> (C-23) 


* Note that this reasoning is valid for fermions as well as for bosons. 
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Since the permutation P, is arbitrary, (C-23) expresses the fact that G | > is 
completely antisymmetric and therefore belongs to &,. 

All operations normally performed on an observable — in particular, the 
determination of eigenvalues and eigenvectors — can therefore be applied to G 
entirely within one of the subspaces, £, or &,. Only the eigenkets of G belonging 
to the physical subspace, and the corresponding eigenvalues, are retained. 


COMMENTS: 


(i) All the eigenvalues of G which exist in the total space & are not necessarily 
found if we restrict ourselves to the subspace $, (or &,). The effect of the 
symmetrization postulate on the spectrum of a symmetric observable G may 
therefore be to abolish certain eigenvalues. On the other hand, it adds 
no new eigenvalues to this spectrum, since, because of the global invariance 
of &, (or £4) under the action of G, any eigenvector of G in $, (or &,) is also 
an eigenvector of G in & with the same eigenvalue. 


(ii) | Consider the problem of writing mathematically, in terms of the observables R,, P;. S,, 
etc., the observables corresponding to the different types of measurement envisaged in $ a. 
This problem is not always simple For example, for a system of three identical particles, 
we shall try to write the observables corresponding to the simultaneous measurement of 
the three positions in terms of R,, R, and R,. We can resolve this problem by considering 
several physical observables chosen such that we can, using the results obtained by 
measuring them, unambiguously deduce the position of each particle (without, of course, 
being able to associate a numbered particle with each position). For example, we can 
choose the set 


NAG X, X, + e + XX, XXX 


(and the corresponding observables for the Y and Z coordinates). However, this point 
of view is rather formal. Rather than trying to write the expressions for the observables 
in all cases, it is simpler to follow the method used in $ a, in which we confined ourselves 
to using the physical eigenkets of the measurement. 


b. TIME-EVOLUTION POSTULATES 


The Hamiltonian of a system of identical particles must be a physical 
observable. We shall write, for example, the Hamiltonian describing the motion 
of the two electrons of the helium atom about the nucleus, assumed to be 
motionless*: 

P? PS 2?  2e e? 
2m, 2m, R, R, |R, -R| 


e 


H(1, 2) = 


(C-24) 


The first two terms represent the kinetic energy of the system ; they are symmetrical 
because the two masses are equal. The next two terms are due to the attraction 
of the nucleus (whose charge is twice that of the proton). The electrons are 
obviously equally affected by this attraction. Finally, the last term describes 


* Here, we shall consider only the most important terms of this Hamiltonian. See comple- 
ment B,,, for a more detailed study of the helium atom. 
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the mutual interaction of the electrons. It is also symmetrical, since neither of 
the two electrons is in a privileged position. It is clear that this argument can be 
generalized to any system of identical particles. Consequently, all the permutation 
operators commute with the Hamiltonian of the system: 


[H, P,] =0 (C-25) 
Under these conditions, if the ket | W(t.) > describing the state of the system 


at a given time £, is a physical ket, the same must be true of the ket | W(t) > obtained 
from | (tg) > by solving the Schrödinger equation. According to this equation: 


| y(t + a> = (1 +o 1) 1w0> (C-26) 
Now, applying P, and using relation (C-25): 

P,| W(t + dt) > = ( +H aj a| W(t) > (C-27) 
If | y(t) > is an eigenvector of P, , | W(t + dt) > is also an eigenvector of P,, with 


the same eigenvalue. Since | y( to) >, by hypothesis, is a completely symmetric or 
completely antisymmetric ket, this property is conserved over time. 

The symmetrization postulate is therefore also compatible with the postulate 
which gives the time evolution of physical systems : the Schrödinger equation does 
not remove the ket | y(r) > from é’s or £4 


D. DISCUSSION 


In this final section, we shall examine the consequences of the symmetri- 
zation postulate on the physical properties of systems of identical particles. First 
of all, we shall indicate the fundamental differences introduced by Pauli’s exclusion 
principle between systems of identical fermions and systems of identical bosons. 
Then, we shall discuss the implications of the symmetrization postulate concerning 
the calculation of the probabilities associated with the various physical processes. 


1. Differences between bosons and fermions. 
Pauli's exclusion principle 


In the statement of the symmetrization postulate, the difference between 
bosons and fermions may appear insignificant. Actually, this simple sign difference 
in the symmetry of the physical ket has extremely important consequences. 
As we saw in $C-3 the symmetrization postulate does not restrict the individual 
states accessible to a system of identical bosons. On the other hand, it requires 
fermions to obey Pauli’s exclusion principle : two identical fermions cannot occupy 
the same quantum mechanical state. 

The exclusion principle was formulated initially in order to explain the 
properties of many-electron atoms ($D-l-a below and complement A,,,,). It can 
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now be seen to be more than a principle applicable only to electrons: it is a 
consequence of the symmetrization postulate, valid for all systems of identical 
fermions. Predictions based on this principle, which are often spectacular, have 
always been confirmed experimentally. We shall give some examples of them. 


a. GROUND STATE OF A SYSTEM OF INDEPENDENT IDENTICAL PARTICLES 


The Hamiltonian of a system of identical particles (bosons or fermions) is 
always symmetrical with respect to permutations of these particles ($ C-4). Consider 
such a system in which the various particles are independent, that is, do not 
interact with each other (at least in a first approximation). The corresponding 
Hamiltonian is then a sum of one-particle operators of the form: 


H(1, 2, .... N) = A(1) + A(2) + ... + A(N) (D-1) 


h(1) is a function only of the observables associated with the particle numbered (1); 
the fact that the particles are identical [which implies a symmetrical Hamiltonian 
H(1, 2, ..., N)] requires this function / to be the same in the N terms of expres- 
sion (D-1). In order to determine the eigenstates and eigenvalues of the total 
Hamiltonian H(l, 2, ..., N), we simply calculate those of the individual 
Hamiltonian h(/) in the state space &(/) of one of the particles: 


AGI On) = Enl Pn: [Pa EEU) (D-2) 


For the sake of simplicity, we shall assume that the spectrum of A(j) is discrete 
and non-degenerate. 

If we are considering a system of identical bosons, the physical eigenvectors 
of the Hamiltonian H(1, 2, ..., N) can be obtained by symmetrizing the tensor 
products of N arbitrary individual states | 9, >: 


| OF sny > Se > Pa | l: Pn, ; 2: Pn e N : Pan > (D-3) 


where the corresponding energy is the sum of the N individual energies: 


E Gy, EE Fu Es (D-4) 


My 2 AN 


[it can easily be shown that each of the kets appearing on the right-hand side 
of (D-3) is an eigenket of H with the eigenvalue (D-4); this is also true of their sum ]. 
In particular, if e, is the smallest eigenvalue of h(j), and |, > is the associated 
eigenstate, the ground state of the system is obtained when the N identical bosons 
are all in the state | p, >. The energy of this ground state is therefore: 


Ena. ¡= Ne, (D-5) 
and its state vector is: 
[OP cna = |l pnp Np (D-6) 


Now, suppose that the N identical particles considered are fermions. It is no 
longer possible for these N particles all to be in the individual state | y, >. To obtain 
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the ground state of the system, Pauli’s exclusion principle must be taken into 
account. If the individual energies e, are arranged in increasing order: 


A E A Se ay (D-7) 
the ground state of the system of N identical fermions has an energy of: 
Evan = €i + e> +o... + En (D-8) 


and it is described by the normalized physical ket: 


as Mia ld 


1 12:0) [2:0 > [2 7@y > 
| sis v= >= y (D-9) 


IN :p,> IN :p,>..|N :0y> 


The highest individual energy ey found in the ground state is called the Fermi 
energy of the system. 

Pauli's exclusion principle thus plays a role of primary importance in all 
domains of physics in which many-electron systems are involved, such as atomic 
and molecular physics (cf. complements A,,, and By) and solid state physics 
(cf. complement C,,,, ), and in all those in which many-proton and many-neutron 
systems are involved, such as nuclear physics*. 


COMMENT: 


In most cases, the individual energies e, are actually degenerate. Each 
of them can then enter into a sum such as (D-8) a number of times equal 
to its degree of degeneracy. 


b. QUANTUM STATISTICS 


The object of statistical mechanics is to study systems composed of a very large 
number of particles (in numerous cases, the mutual interactions between these 
particles are weak enough to be neglected in a first approximation). Since we do not 
know the microscopic state of the system exactly, we content ourselves with 
describing it globally by its macroscopic properties (pressure, temperature, 
density, etc.) A particular macroscopic state corresponds to a whole set of 
microscopic states. We then use probabilities : the statistical weight of a macroscopic 
state is proportional to the number of distinct microscopic states which correspond 
to it, and the system, at thermodynamic equilibrium, is in its most probable 
macroscopic state (with any constraints that may be imposed taken into account). 
To study the macroscopic properties of the system, it is therefore essential to 


* The ket representing the state of a nucleus must be antisymmetric both with respect to the 
set of protons and with respect to the set of neutrons. 
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determine how many different microscopic states possess certain characteristics 
and, in particular. a given energy. 

In classical statistical mechanics (Maxwell-Boltzmann statistics), the N par- 
ticles of the system are treated as if they were of different natures, even if they are 
actually identical. Such a microscopic state is defined by specifying the individual 
state of each of the N particles. Two microscopic states are considered to be 
distinct when these N individual states are the same but the permutation of the 
particles is different. 

In quantum statistical mechanics, the symmetrization postulate must be taken 
into account. A microscopic state of a system of identical particles is characterized 
by the enumeration of the N individual states which form it, the order of these states 
being of no importance since their tensor product must be symmetrized or anti- 
symmetrized. The numbering of the microscopic states therefore does not lead to the 
same result as in classical statistical mechanics. In addition, Pauli’s principle radi- 
cally differentiates systems of identical bosons and systems of identical fermions: the 
number of particles occupying a given individual state cannot exceed one for 
fermions, while it can take on any value for bosons (cf. $ C-3). Different statistical 
properties result: bosons obey Bose-Einstein statistics and fermions, Fermi-Dirac 
Statistics. This is the origin of the terms “bosons” and “fermions”. 

The physical properties of systems of identical fermions and systems of 
identical bosons are very different. These differences can be observed, for example. 
at low temperatures. The particles then tend to accumulate in the lowest-energy 
individual states, as is possible for identical bosons (this phenomenon is called Bose 
condensation), while identical fermions are subject to the restrictions of Pauli's 
principle. Bose condensation is at the origin of the remarkable properties (super- 
fluidity) of the *He isotope of helium, while the *He isotope, which is a fermion 
(cf. comment of $ C-1), does not possess the same properties. 


2. The consequences of particle indistinguishability 
on the calculation of physical predictions 


In quantum mechanics. all the predictions concerning the properties of a system 
are expressed in terms of probability amplitudes (scalar products of two state vectors) 
or matrix elements of an operator. It is then not surprising that the symmetrization 
or antisymmetrization of state vectors causes special interference effects to appear in 
systems of identical particles. First, we shall specify these effects, and then we shall 
see how they disappear under certain conditions (the particles of the system, although 
identical, then behave as if they were of different natures). To simplify the discussion, 
we shall confine ourselves to systems containing only two identical particles. 


a. INTERFERENCES BETWEEN DIRECT AND EXCHANGE PROCESSES 


x. Predictions concerning a measurement on a system of identical particles : 
the direct term and the exchange term 


Consider a system of two identical particles, one of which is known to be in 
the individual state |g > and the other, in the individual state | 7 >. We shall 
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assume | ọ > and | y > to be orthogonal, so that the state of the system is described 
by the normalized physical ket [ cf. formula (C-4)]: 


= 


Josx> =~ EU + sPall 1: 0:2 24> (D-10) 
2 
where: 

E= +1 if the particles are bosons 

£= — |l if the particles are fermions (D-11) 


With the system in this state, suppose that we want to measure on each of the 
two particles the same physical quantity B with which the observables B(1) 
and B(2) are associated. For the sake of simplicity, we shall assume that the 
spectrum of B is entirely discrete and non-degenerate : 


B|u,>=b,|u,> (D-12) 


What is the probability of finding certain given values in this measurement (5, for 
one of the particles and 4,. for the other one)? We shall begin by assuming b, and b, 
to be different, so that the corresponding eigenvectors | u, > and | u,. > are orthogonal. 
Under these conditions, the normalized physical ket defined by the result of this 
measurement can be written: 


l 
Upi Up > => 
| 2 


which gives the probability amplitude associated with this result: 


[1 +EP, ]] 30,230, (D-13) 


1 
< upi Uy | @3%> =3< 1 3432 say (1 + eP} (1 +6P,,)|1:9;32:x%> 


(D-14) 
Using properties (B-13) and (B-14) of the operator P,,, we can write: 
5 +EP! (1 +eP,,)=1 +8P,, (D-15) 
(D-14) then becomes: 
cut leido = Cl 2 a (l Py) di 2 (D-16) 
Letting 1 + eP,, act on the bra, we obtain: 
< Upi Uy O A ME 
tet lines: 2iu,|12@i2:x> 
== bp eo | 2 
+8(1:4,|1:9><2:04,|2:x> 
=< u| P) Lun |X) + eC ty |>|) (D-17) 
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The numbering has disappeared from the probability amplitude, which is now 
expressed directly in terms of the scalar products < u, | @ >... < u, | x >. Also, the 
probability amplitude appears either as a sum (for bosons) or a difference (for 
fermions) of two terms, with which we can associate the diagrams of figures 4-a and 
4-b. 


<u, | <———_——> | o> ¢ u, | le> 

< u,, | <————+ | 1 € uy, | ED 
a b 

FIGURE 4 


Schematic representation of the direct term and the exchange term associated with a measurement 
performed on a system of two identical particles. Before the measurement, one of the particles is 
known to be in the state le> and the other one, in the state lx >. The measurement result 
obtained corresponds to a situation in which one particle is in the state | u, > and the other one, in the 
state |u,, >. Two probability amplitudes are associated with such a measurement; they are 
represented schematically by figures a and b. These amplitudes interfere with a + sign for 
bosons and with a — sign for fermions. 


We can interpret result (D-17) in the following way. The two kets | p > 
and | x > associated with the initial state can be connected to the two bras < u, | and 
< 4, | associated with the final state by two different “paths”, represented schema- 
tically by figures 4-a and 4-b. With each of these paths is associated a probability 
amplitude, <u, |o > <u, | X> or <u, |> <u, |x>, and these two amplitudes 
interfere with a + sign for bosons and a — sign for fermions. Thus, we obtain the 
answer to the question posed in $ A-3-a above: the desired probability 2(b,: b,.) 
is equal to the square of the modulus of (D-17): 


Plb, by) = Kuro > lup) + eK my 1 E> <u | xl (D-18) 


One of the two terms on the right-hand side of (D-17), the one which corresponds, 
for example, to path 4-a, is often called the direct term. The other term is called the 
exchange term. 


COMMENT: 


Let us examine what happens if the two particles, instead of being iden- 
tical, are of different natures. We shall then choose as the initial state of the 
system the tensor product ket: 


ly> = |1:@;2:z%> (D-19) 


Now, consider a measurement instrument which, although the two particles, 
(1) and (2), are not identical, is not able to distinguish between them. If it 
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yields the results b, and b,., we do not know if b, is associated with particle (1) 
or particle (2) (for example, for a system composed of a muon u” and an 
electron e”, the measurement device may be sensitive only to the charge of the 
particles, giving no information about their masses). The two eigenstates 
| 1 :u,32:u,.> and |1:u,.;2:u, > (which, in this case, represent different 
physical states) then correspond to the same measurement result. Since they 
are orthogonal, we must add the corresponding probabilities, which gives: 


Pb,ib,)=K1:4,32:4,|1:032: 101 
+ K1l:4,32:4.|1:0932: xl? 
= K u, |o >|? K up |x >l T K uy | o >|? E un |x >l? (D-20) 


Comparison of (D-18) with (D-20) clearly reveals the significant difference 
in the physical predictions of quantum mechanics depending on whether the 
particles under consideration are identical or not. 


Now consider the case in which the two states | u, > and | u,. > are the same. 


When the two particles are fermions, the corresponding physical state is excluded 
by Pauli's principle, and the probability 2(b,: b,) is zero. On the other hand, if the 
two particles are bosons, we have: 


|i, = |1: up; 2i) (D-21) 


and, consequently : 


1 
Telpa CIA + Pa )|1:0:2:7> 


=/2<u,|9><u,|7> (D-22) 


which gives: 


(ii) 
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Piba; by) = 2 |u, | 9 > < Ua |x >l? (D-23) 


COMMENTS: 


Let us compare this result with the one which would be obtained in the case, 
already considered above, in which the two particles are different. We must 
then replate | ø: y > by | 1: @;2:7 >and | up: tp > by | 1 : up: 2 : u, >. which 
gives the value for the probability amplitude : 


CUIDAD (D-24) 
and, consequently : 


P'(b, db.) = Ku lp><a |x? (D-25) 


For a system containing N identical particles, there are, in general, N! distinct exchange 
terms which add (or subtract) in the probability amplitude. For example. consider a 
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system of three identical particles in the individual states | @ >. | y > and | œ >. and the 
probability of finding, in a measurement, the results b,, b,. and b,,.. The possible “paths” 
are then shown in figure 5. There are six such paths (all different if the three eigenvalues, 


4.110 > a yee ee 
Gi [2> <ul [z> 
< up |e lo en Gs < Uy» Njo) 


+ + + 


< m| elo) <ul le> “Sc” 
“ST < uy | 2> <ul ES: 
< uy | jas QA | > < up A | > 


E E E 


FIGURE 5 


Schematic representation of the six probability amplitudes associated with a system of three 
identical particles. Before the measurement, one particle is known to be in the state | p >, another, 
in the state | x >, and the last one, in the state | w >. The result obtained corresponds to a situation 
in which one particle is in the state | u, >, another, in the state | u,, >, and the last one, in the 
state |u,.)>. The six amplitudes interfere with a sign which is shown beneath each one 
le = + 1 for bosons, — | for fermions). 


ba b, and 6,,, are different). Some always contribute to the probability amplitude with 
a + sign, others with an e sign (+ for bosons and — for fermions). 


B. Example : elastic collision of two identical particles 


To understand the physical meaning of the exchange term, let us examine 
a concrete example (already alluded to in §A-3-a): that of the elastic collision of 
two identical particles in their center of mass frame*. Unlike the situation in §x above. 
here we must take into account the evolution of the system between the initial time 
when it is in the state | y; > and the time ¢ when the measurement is performed. 
However, as we shall see, this evolution does not change the problem radically, 
and the exchange term enters the problem as before. 


* We shall give a simplified treatment of this problem, intended only to illustrate the relation 


between the direct term and the exchange term. In particular, we ignore the spin of the two particles. 
However, the calculations of this section remain valid in the case in which the interactions are not 


spin-dependent and the two particies are initially in the same spin state. 
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In the initial state of the system (fig. 6-a), the two particles are moving towards 
each other with opposite momenta. We choose the Oz axis along the direction of 
these momenta, and we denote their modulus by p. One of the particles thus possesses 
the momentum pe,, and the other one, the momentum — pe, (where e, is the unit 
vector of the Oz axis). We shall write the physical ket | y; > representing this initial 
state in the form: 


1 
pr + ¢P,)| 1: pe,32: — pe, > (D-26) 


Z 


| y; > describes the state of the system at 1, before the collision. 


n 
ra 
rd 
ra 
pc o P ==> nn E AEE 
(0) Zz “O z 
z 
vá 
a b 
Initial state x Final state 
XK 
FIGURE 6 


Collision between two identical particles in the center of mass frame : the momenta of the two 
particles in the initial state (fig. a) and in the final state found in the measurement (fig. b) are 
represented. For the sake of simplicity, we ignore the spin of the particles. 


The Schródinger equation which governs the time evolution of the system is 
linear. Consequently, there exists a linear operator U(t, t), which is a function of 
the Hamiltonian H, such that the state vector at time f is given by: 


| W(t) > = Ult, to) | Wi> (D-27) 


(complement F,,,). In particular, after the collision, the state of the system at time £, 
is represented by the physical ket: 


|Y) > = Ulti to) |W; > (D-28) 


Note that, since the Hamiltonian H is symmetric, the evolution operator U commutes 
with the permutation operator: 


[U(t, t’), Pa] = 0 (D-29) 


Now, let us calculate the probability amplitude of the result envisaged in § A-3-a, 
in which the particles are detected in the two opposite directions of the On axis, 
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of unit vector n (fig. 6-b). We denote the physical ket associated with this final state 
by: 


[vp > =a (0 + oPay)|1 spms2: — pn) (D-30) 
V 


The desired probability amplitude can therefore be written: 
< Wr | Y(t) > = < Wp | U(t,, to) | Y, > 


1 
=5<! :pn;2: — pal (1 + ¢P$,)U(t,, tol +eP,¡)| 1: pe,; 2: — pe,» 
(D-31) 


According to relation (D-29) and the properties of the operator P, ,, we finally obtain: 


«Wy | Ulti to) | yi> 
= <1 :pm;2: — pn| (1 + eP1,)U(t,, to) | 1: pe,: 2: — pe, > 
= <1:pn;2: — pn| U(t,,to)| 1: pe,; 2: — pe, > 
tel: — pm, 2: pn] U(t,, to) | 1: pe,;2:—pe,> (D-32) 


The direct term corresponds, for example, to the process shown in figure 7-a, and the 
exchange term is then represented by figure 7-b. Again, the probability amplitudes 
associated with these two processes must be added or subtracted. This causes an 


FIGURE 7 


Collision between two identical particles in the center of mass frame : schematic representation 
of the physical processes corresponding to the direct term and the exchange term. The scattering 
amplitudes associated with these two processes interfere with a + sign for bosons and a — sign 
for fermions. 


interference term to appear when the square of the modulus of expression (D-32) is 
taken. Note also that this expression is simply multiplied by £ if n is changed to — n, 
so that the corresponding probability is invariant under this change. 
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b. SITUATIONS IN WHICH THE SYMMETRIZATION POSTULATE CAN BE IGNORED 


If application of the symmetrization postulate were always indispensable, it 
would be impossible to study the properties of a system containing a restricted 
number of particles, because it would be necessary to take into account all the 
particles in the universe which are identical to those in the system. We shall see 
in this section that this is not the case. In fact, under certain special conditions, 
identical particles behave as if they were actually different, and it is not necessary 
to take the symmetrization postulate into account in order to obtain correct physical 
predictions. It seems natural to expect, considering the results of §D-2-a, 
that such a situation would arise whenever the exchange terms introduced by the 
symmetrization postulate are zero. We shall give two examples. 


%. Identical particles situated in two distinct regions of space 


Consider two identical particles, one of which is in the individuai state | ọ > 
and the other, in the state | y >. To simplify the notation, we shall ignore their spin. 
Suppose that the domain of the wave functions representing the kets | y > and | 7 > 
are well separated in space: 


eee 
u(r) = <r] x> 


0 if réD 


O if rgd (Dea 


where the domains D and A do not overlap. The situation is analogous to the classical 
mechanical one (§A-2): as long as the domains D and 4 do not overlap, each of the 
particles can be “followed”; we therefore expect application of the symmetrization 
postulate to be unnecessary. 

In this case, we can envisage measuring an observable related to one 
of the two particles. All we need is a measurement device placed so that it cannot 
record what happens in the domain D, or in the domain 4. If it is D which is excluded 
in this way, the measurement will only concern the particle in 4, an vice versa. 

Now, imagine a measurement concerning the two particles simultaneously, 
but performed with two distinct measurement devices, one of which is not sensitive 
to phenomena occurring in 4, and the other, to those in D. How can the probability 
of obtaining a given result be calculated? Let | u > and | v > be the individual states 
associated respectively with the results of the two measurement devices. Since the 
two particles are identical, the symmetrization postulate must, in theory, be taken 
into account. In the probability amplitude associated with the measurement result, 
the direct term is then < u | p > <v | x >, and the exchange term is <u | z> <v|@)>. 
Now, the spatial disposition of the measurement devices implies that: 


ur)=<rlu>=0 if red 
vr) =<rjv>=0 if reD (D-34) 


According to (D-33) and (D-34), the wave functions u(r) and y(r) do not overlap; 
neither do v(r) and g(r), so that: 


<ulx>=<vlg>=0 (D-35) 
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The exchange term is therefore zero. Consequently, it is unnecessary, in this situation, 
to use the symmetrization postulate. We obtain the desired result directly by rea- 
soning as if the particles were of different natures, labeling, for example, the one in 
the domain D with the number |, and the one situated in 4 with the number 2. 
Before the measurement, the state of the system is then described by the ket 
|1:@:2:%), and with the measurement result envisaged is associated the ket 
| 1 : uw: 2: v >. Their scalar product gives the probability amplitude < u |o ò <v |z >. 

This argument shows that the existence of identical particles does not prevent 
the separate study of restricted systems, composed of a small number of particles. 


COMMENT: 


In the initial state chosen, the two particles are situated in two distinct regions 
of space. In addition, we have defined the state of the system by specifying two individual 
states. We might wonder if, after the system has evolved, it is still possible to study one 
of the two particles and ignore the other one. For this to be the case, it is necessary, not 
only that the two particles remain in two distinct regions of space, but also that they 
do not interact. Whether the particles are identical or not, an interaction always introduces 
correlations between them, and it is no longer possible to describe each of them by a state 
vector. 


B. Particles which can be identified by the direction of their spins 


Consider an elastic collision between two identical spin 1/2 particles (electrons, 
for example), assuming that spin-dependent interactions can be neglected, so that 
the spin states of the two particles are conserved during the collision. If these spin 
states are initially orthogonal, they enable us to distinguish between the two particles 
at all times, as if they were not identical ; consequently, the symmetrization postulate 
should again have no effect here. 


$ 
Y 
Y 

T Fa Z 

o o o A fi € PETE AENA E < AA Z 
(0) L » “0 
a a 4 b 
FIGURE 8 


Collision between two identical spin 1/2 particles in the center of mass frame : a schematic represen- 
tation of the momenta and spins of the two particles in the initial state (fig. a) and in the final 
state found in the measurement (fig. b). If the interactions between the two particles are spin- 
independent, the orientation of the spins does not change during the collision. When the two 
particles are not in the same spin state before the collision (the case of the figure), it is possible 
to determine the “path” followed by the system in arriving at a given final state. For example, 
the only scattering process which leads to the final state of figure b and which has a non-zero 
amplitude is of the type shown in figure 7-a. 
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We can show this, using the calculation of §D-2-a-B. The initial physical ket 
will be, for example (fig. 8-a): 


= Pal | pea 2 = pe > (D-36) 


1 
. = ——/(]| 
|w;> a 


(where the symbol + or — added after each momentum indicates the sign of the 
spin component along a particular axis). The final state which we are considering 
(fig. 8-b) will be described by: 


lyr) =e (1 Pa) | 1 :pn,+:2:— pn,- > (D-37) 


V2 


Under these conditions, only the first term of (D-32) is different from zero, since the 
second one can be written: 


<1: — pn, —;2:pn, + |U(t,, tp) | 1: pe, +:2: — pe, — > (D-38) 


This is the matrix element of a spin-independent operator (by hypothesis) between 
two kets whose spin states are orthogonal; it is therefore zero. Consequently, we 
would obtain the same result if we treated the two particles directly as if they were 
different, that is, if we did not antisymmetrize the initial and final kets and if we 
associated index 1 with the spin state | + > and index 2 with the spin state | — >. 
Of course, this is no longer possible if the evolution operator U, that is, the 
Hamiltonian A of the system, is spin-dependent. 


References and suggestions for further reading: 


The importance of interference between direct and exchange terms is stressed 
in Feynman III (1.2), §3.4 and chap. 4. 

Quantum statistics : Reif (8.4). Kittel (8.2). 

Permutation groups: Messiah (1.17), app. D, §1V: Wigner (2.23), chap. 13; 
Bacry (10.31), §§ 41 and 42. 

The effect of the symmetrization postulate on molecular spectra: Herzberg (12.4), 
Vol. I, chap. III, $ 2f. 

An article giving a popularized version: Gamow (1.27). 
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Axiy: MANY-ELECTRON ATOMS. 
ELECTRONIC CONFIGURATIONS 


Bxw: ENERGY LEVELS OF THE HELIUM ATOM : 


CONFIGURATIONS, TERMS, MULTIPLETS 


Cyiy: PHYSICAL PROPERTIES 
OF AN ELECTRON GAS. 
APPLICATION TO SOLIDS 


Axıy: simple study of many-electron atoms in the 
central-field approximation. Discusses the con- 
sequences of Pauli’s exclusion principle and 
introduces the concept of a configuration. 
Remains qualitative. 


By: study, in the case of the helium atom, of 
the effect of the electrostatic repulsion between 
electrons and of the magnetic interactions. Intro- 
duces the concepts of terms and multiplets. 
Can be reserved for later study. 


Cy jy: study of the ground state of a gas of free 
electrons enclosed in a “box”. Introduces the 
concept of a Fermi energy and periodic boundary 
conditions, Generalization to electrons in solids 
and qualitative discussion of the relation between 
electrical conductivity and the position of the 
Fermi level. Moderately difficult. The physical 
discussions are emphasized. Can be considered 
to be a sequel of Fy. 


Du: EXERCISES 
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Complement Axiy 


MANY-ELECTRON ATOMS. ELECTRONIC CONFIGURATIONS 


1. The central-field approximation 


a. Difficulties related to electron interactions 
b. Principle of the method 
c. Energy levels of the atom 


2. Electronic configurations of various elements 


The energy levels of the hydrogen atom were studied in detail in chapter VII. Such 
a study is considerably simplified by the fact that the hydrogen atom possesses 
a single electron, so that Pauli's principle is not relevant. In addition, by using the 
center of mass frame, we can reduce the problem to the calculation of the energy 
levels of a single particle (the relative particle) subject to a central potential. 

In this complement, we shall consider many-electron atoms. for which 
these simplifications cannot be made. In the center-of-mass frame, we must solve 
a problem involving several non-independent particles. We shall see that this is 
a complex problem and give only an approximate solution, using the central-field 
approximation (which will be outlined, without going into details of the 
calculations). In addition, Pauli’s principle, as we shall show, plays an important 
role. 


1. The central-field approximation 


Consider a Z-electron atom. Since the mass of its nucleus is much larger (several 
thousand times) than that of the electrons, the center-of-mass of the atom practically 
coincides with the nucleus, which we shall therefore assume to be motionless at the 
coordinate origin*. The Hamiltonian describing the motion of the electrons, 
neglecting relativistic corrections and, in particular, spin-dependent terms, can be 
written : 


Z P? Z ze 
H = — — — + l 
2, 2m, i=1 R; 2. RR] R- =] l ) 
We have numbered the electrons arbitrarily from 1 to Z, and we have set: 
2 
e= (2) 


Ane 


* Making this approximation amounts to neglecting the nuclear finite mass effect. 
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where q is the electron charge. The first term of the Hamiltonian (1) represents the 
total kinetic energy of the system of Z electrons. The second one arises from the 
attraction exerted on each of them by the nucleus, which bears a positive charge 
equal to — Zq. The last one describes the mutual repulsion of the electrons [note 
that the summation is carried out here over the Z(Z — 1)/2 different ways of pairing 
the Z-electrons ]. 

The Hamiltonian (1) is too complicated for us to solve its eigenvalue equation 
exactly, even in the simplest case, that of helium (Z = 2). 


a. DIFFICULTIES RELATED TO ELECTRON INTERACTIONS 


2 
In the absence of the mutual interaction term Y — 
i<j IR; — R,| 


would be independent. It would then be easy to determine the energies of the atom. 
We would simply sum the energies of the Z électrons placed individually in the 
Coulomb potential — Ze?/r, and the theory presented in chapter VII would yield 
the result immediately. As for the eigenstates of the atom, they could be obtained 
by antisymmetrizing the tensor product of the stationary states of the various 
electrons. 

It is then the presence of the mutual interaction term that makes it difficult 
to solve the problem exactly. We might try to treat this term by perturbation theory. 
However, a rough evaluation of its relative magnitude shows that this would not 
yield a good approximation. We expect the distance |R, — R| between two electrons 
to be, on the average, roughly the distance R, of an electron from the nucleus. The 
ratio p of the third term of formula (1) to the second one is therefore approximately 
equal to: 


in H, the electrons 


az — 1) 
2 
panan n (3) 

p varies between 1/4 for Z = 2 and 1/2 for Z much larger than 1. Consequently, 
the perturbation treatment of the mutual interaction term would yield, at most, 
more or less satisfactory results for helium (Z = 2), but it is out of the question to 
apply it to other atoms (p is already equal to 1/3 for Z = 3). A more elaborate 
approximation method must therefore be found. 


b. PRINCIPLE OF THE METHOD 


To understand the concept of a central field, we shall use a semi-classical 
argument. Consider a particular electron (i). In a first approximation, the existence 
of the Z — 1 other electrons affects it only because their charge distribution 
partially compensates the electrostatic attraction of the nucleus. In this approxi- 
mation, the electron (/) can be considered to move in a potential which depends only 
on its position r; and takes into account the average effect of the repulsion of the 
other electrons. We choose a potential V.(r;) which depends only on the modulus 
of r; and call it the “central potential” of the atom under consideration. Of course, 
this can only be an approximation : since the motion of the electron (i) actually 
influences that of the (Z — 1} other electrons, it is not possible to ignore the corre- 
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lations which exist between them. Moreover, when the electron (i) is in the immediate 
vicinity of another electron (/), the repulsion exerted by the latter becomes prepon- 
derant, and the corresponding force is not central. However, the idea of an average 
potential appears more valid in quantum mechanics, where we consider the 
delocalization of the electrons as distributing their charges throughout an extended 
region of space. 

These considerations thus lead us to write the Hamiltonian (1) in the form: 


H = > E + HAR) | + W (4) 
with 
WA A n 


If the central potential K(r;) is suitably chosen, W should play the role of 
a small correction in the Hamiltonian H. The central-field approximation then 
consists of neglecting this correction, that is of choosing the approximate Hamil- 
tonian: 


Ho = X [Fe + VARD | 6) 


i=t e 


W will then be treated like a perturbation of H, (cf. complement Bxyy, $2). The 
diagonalization of H, leads to a problem of independent particles : to obtain the 
eigenstates of H,, we simply determine those of the one-electron Hamiltonian: 


p? 
2m 


) (7) 


Definitions (4) and (5) do not, of course, fix the central potential V(r), since 
we always have H = H, + W, for all V(r). However, in order to treat W like 
a perturbation, V.(r) must be wisely chosen. We shall not take up the problem of the 
existence and determination of such an optimal potential here. This is a complex 
problem. The potential V(r) to which a given electron is subject depends on the 
spatial distribution of the (Z — 1) other electrons, and this distribution, in turn, 
depends on the potential V(r), since the wave functions of the (Z — 1) electrons must 
also be calculated from V(r). We must therefore arrive at a coherent solution (one 
generally says “self-consistent”), for which the wave functions determined from V(r) 
give a charge distribution which reconstitutes this same potential V(r). 


c. ENERGY LEVELS OF THE ATOM 


While the exact determination of the potential V(r) requires rather long 
calculations, the short- and long-distance behavior of this potential is simple to 
predict. We expect, for small r, the electron (i) under consideration to be inside the 
charge distribution created by the other electrons, so that it “sees” only the attractive 
potential of the nucleus. On the other hand, for large r, that is, outside the “cloud ” 
formed by the (Z — 1) electrons treated globally, it is as if we had a single point 
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charge situated at the coordinate origin and equal to the sum of the charges of the 
nucleus and the “cloud” [the (Z — 1) electrons screen the field of the nucleus]. 
Consequently (fig. 1): 


e? 
Vir) = — e for large r 
(8) 


2 
Vir) = = a for small r 


For intermediate values of r, the variation of V(r) can be more or less complicated, 
depending on the atom under consideration. 


14n) 


FIGURE 1 


Variation of the central potential V(r) 
with respect to r. The dashed-line curves 
represent the behavior of this potential at 
short distances (— Ze?/r) and at long dis- 
tances (— e?/r). 


Although these considerations are qualitative, they give an idea of the spectrum 
of the one-electron Hamiltonian (7). Since V(r) is not simply proportional to 1/r, 
the accidental degeneracy found for the hydrogen atom (chap. VII, §C-4-b) is no 
longer observed. The eigenvalues of the Hamiltonian (7) depend on the two quantum 
numbers » and / [however, they remain independent of m, since V(r) is central]. 
/, of course, characterizes the eigenvalue of the operator L?, and » is, by definition 
(as for the hydrogen atom), the sum of the azimuthal quantum number /, and the 
radial quantum number & introduced in solving the radial equation corresponding 
to /;n and / are therefore integral and satisfy: 


0O</<n- 1 (9) 
Obviously, for a given value of /, the energies E, increase with n: 


E,,>E,, if n>n (10) 
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For fixed n, the energy is lower when the corresponding eigenstate is more “ pene- 
trating”, that is, when the probability density of the electron in the vicinity of the 
nucleus is larger [according to (8), the screening effect is then smaller]. The 
energies E, , associated with the same value of n can therefore be arranged in order 
of increasing angular momenta: 


Br S Erg © a By (11) 


n 


It so happens that the hierarchy of states is approximately the same for all atoms, 
although the absolute values of the corresponding energies obviously vary with Z. 
Figure 2 indicates this hierarchy, as well as the 2(2/ + 1)-fold degeneracy of each 
state (the factor 2 comes from the electron spin). The various states are represented 
in spectroscopic notation (cf. chap. VII, $C-4-b). Those shown inside the same 
brackets are very close to each other, and may even, in certain atoms, practically 
coincide (we stress the fact that figure 2 is simply a schematic representation intended 
to situate the eigenvalues E, , with respect to each other; no attempt is made to 
establish an even moderately realistic energy scale). 

Note the great difference between the energy spectrum shown and that of the 
hydrogen atom (cf. chap. VII, fig. 4). As we have already pointed out, the energy 
depends here on the orbital quantum number /, and, in addition, the order of the 
states is different. For example, figure 2 indicates that the 4s shell has a slightly 
lower energy than that of the 3d shell. This is explained, as we have seen, by the fact 
that the 4s wave function is more penetrating. Analogous inversions occur for the 
n =4 and n = 5 shells, etc. This demonstrates the importance of inter-electron 
repulsion. 


2. Electron configurations of various elements 


In the central-field approximation, the eigenstates of the total Hamiltonian H, 
of the atom are Slater determinants, constructed from the individual electron states 
associated with the energy states E, , which we have just described. This is therefore 
the situation envisaged in $ D-1-a of chapter XIV: the ground state of the atom is 
obtained when the Z electrons occupy the lowest individual states compatible with 
Pauli’s principle. The maximum number of electrons which can have a given 
energy E, is equal to the 2(2/ + 1)-fold degeneracy of this energy level. The set of 
individual states associated with the same energy E, , is called a shell. The list of 
occupied shells with the number of electrons found in each is called the electronic 
configuration. The notation used will be specified below in a certain number of 
examples. The concept of a configuration also plays an important role in the chemical 
properties of atoms. Knowledge of the wave functions of the various electrons and 
of the corresponding energies makes it possible to interpret the number, stability, 
and geometry of the chemical bonds which can be formed by this atom (cf. comple- 
ment E,,,;)- 

To determine the electronic configuration of a given atom in its ground state, 
we simply “fill” the various shells successively, in the order indicated in figure 2 
(starting, of course, with the Is level) until the Z electrons are exhausted. This is 
what we shall do, in a rapid review of Mendeleev’s table. 

In the ground state of the hydrogen atom, the single electron of this atom 
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(o) 


6p 
y a 0 
{ ay 0 — } 
E 
5p 
(6) 
{ e Ss 
(10) (2) 
etc... 
4p 
(6) 
{ 3d fe | Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Cu, Zn 
10 —_——— 
( ) (2) K,Ca 
3p i 
(6) Al, Si, P, S, CLA 
ar Na, Mg 
a B, C, N, O, F, Ne 
a Li, Be 
5 H, He l 
n=l n=2 n=3 n=4 n=5 n=6 n=7 
FIGURE 2 


Schematic representation of the hierarchy of energy levels (electronic shells) in a central potential 
of the type shown in figure 1. For each value of n, the energy increases with /. The degeneracy 
of each level is indicated in parentheses. The levels which appear inside the same bracket are 
very close to each other, and their relative disposition can vary from one atom to another. 

On the right-hand side of the figure, we have indicated the chemical symbols of the atoms for 
which the electronic shell appearing on the same line is the outermost shell occupied in the ground 
state configuration. 
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occupies the Is level. The electronic configuration of the next element (helium, 
Z = 72) is: 


He: Is? (12) 


which means that the two electrons occupy the two orthogonal states of the 1s shell 
(same spatial wave function, orthogonal spin states). Then comes lithium (Z = 3), 
whose electronic configuration is: 


Li: 1s°, Te (13) 


The ls shell can accept only two electrons, so the third one must go into the level 
which is directly above it, that is, according to figure 2, into the 2s shell. This shell 
can accept a second electron, which gives beryllium (Z = 4) the electronic configu- 
ration: 


Be: 15?, 2s? (14) 


For Z > 4, the 2p shell (cf: fig. 2) is the first to be gradually filled, and so on. As the 
number Z of electrons increases. higher and higher electronic shells are brought in 
(on the right-hand side of figure 2, we have shown, opposite each of the lowest 
shells, the symbols of the atoms for which this shell is the outermost). Thus, we obtain 
the cenfigurations of the ground state for all the atoms. This explains Mendeleev's 
classification. However, it must be noted that levels which are very close to each 
other (those grouped in brackets in figure 2) may be filled in a very irregular fashion. 
For example, although figure 2 gives the 4s shell a lower energy than that of the 
3d shell, chromium (Z = 24) has five 3d electrons although the 4s shell is incomplete. 
Similar irregularities arise for copper (Z = 29), niobium (Z = 41), etc. 


COMMENTS: 


(i) The electronic configurations which we have analyzed characterize the ground 
state of various atoms in the central-field approximation. The lowest excited 
states of the Hamiltonian H, are obtained when one of the electrons moves 
to an individual energy level which is higher than the last shell occupied in the 
ground state. We shall see, for example, in complement B,,y, that the first 
excited configuration of the helium atom is: 


ls, 2s (15) 


(ii) A single non-zero Slater determinant is associated with an electronic configu- 
ration ending with a complete shell, since there are then as many orthogonal 
individual states as there are electrons. Thus, the ground state of the rare gases 
(... ns?, np®) is non-degenerate, as is that of the alkaline-earths (..., ns”). On the 
other hand, when the number of external electrons is smaller than the degree 
of degeneracy of the outermost shell, the ground state of the atom is degenerate. 
For the alkalines (..., ns), the degree of degeneracy is equal to 2; for carbon 
(1s, 257, 2p?), it is equal to CZ = 15, since two individual states can be chosen 
arbitrarily from the six orthogonal states constituting the 2p shell. 
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It can be shown that, for a complete shell, the total angular momentum is zero, 
as are the total orbital angular momentum and the total spin (the sums, 
respectively, of the orbital angular momenta and the spins of the electrons 
occupying this shell). Consequently, the angular momentum of an atom* 
is due only to its outer electrons. Thus, the total angular momentum of 
a helium atom in its ground state is zero, and that of an alkali metal is equal 
to 1/2 (a single external electron of zero orbital angular momentum and 


spin 1/2). 


References and suggestions for further reading: 


Pauling and Wilson (1.9), chap. IX; Levine (12.3), chap. 11, $51. 2 and 3; 
Kuhn (11.1). chap. IV, §§ A and B: Schiff (1.18), $47; Slater (1.6). chap. 6: Landau and 
Lifshitz (1.19), $5 68. 69 and 70. See also references of chap. XI (Hartree and Hartree-Fock 


methods). 
The shell model in nuclear physics : Valentin (16.1), chap. VI: Preston (16.4). 


chap. 7: Deshalit and Feshbach (16.6), chap. IV and V. See also articles by Mayer (16.20). 
Peierls (16.21) and Baranger (16.22). 


* The angular momentum being discussed here is that of the electronic cloud of the atom. The 


nucleus also possesses an angular momentum which should be added to this one. 


1417 


www.elsolucionario.net 


COMPLEMENT Byy 


Complement Byy 


ENERGY LEVELS OF THE HELIUM ATOM: 
CONFIGURATIONS, TERMS, MULTIPLETS 


1. The central-field approximation. Configurations 


a. The electrostatic Hamiltonian 
b. The ground state configuration and first excited configurations 
c. Degeneracy of the configurations 
2. The effect of the inter-electron electrostatic repulsion: exchange energy, spectral terms 
a. Choice of a basis of € (n, 1; n', l) adapted to the symmetries of W 
a. Total orbital momentum L and total spin S 
B. Constraints imposed by the symmetrization postulate 
b. Spectral terms. Spectroscopic notation 
c. Discussion 
a. Energies of the spectral terms arising from the 1s, 2s configuration 
B. The exchange integral 
y. The physical origin of the energy difference of the two spectral terms 
6. Analysis of the role played by the symmetrization postulate 
&. The effective spin-dependent Hamiltonian 


3, Fine-structure levels; multiplets 


In the preceding complement, we studied many-electron atoms in the central-field 
approximation in which the electrons are independent. This enabled us to introduce 
the concept of a configuration. We shall evaluate the corrections which must be made 
to this approximation, taking into account the inter-electron electrostatic repulsion 
more precisely. In order to simplify the reasoning, we shall confine ourselves to the 
simplest many-electron atom, the helium atom. We shall show that, under the effect 
of the inter-electron electrostatic repulsion, the configurations of this atom (§1) 
split into spectral terms (§2), which give rise to fine-structure multiplets (§3) when 
smaller terms in the atomic Hamiltonian (magnetic interactions) are taken into 
account. The concepts we shall bring out in this treatment can be generalized to 
more complex atoms. 


1. The central-field approximation. Configurations 
a. THE ELECTROSTATIC HAMILTONIAN 


As in the preceding complement, we shall take into account only the electro- 
static forces at first, writing the Hamiltonian of the helium atom [formula (C-24) of 
chapter XIV ] in the form: 


H=H,+W (1) 
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where: 
P? P? 
Ho = TA + Im. + V{R,) + V(R3) (2) 
and: 
2e? 2e? e? 
W = — — VÁR,) — V{R2) (3) 


=— — =— SS oF 
R; R3 IR, z R,| 


The central potential V(r) is chosen so as to make W a small correction of Ho. 

When W is neglected, the electrons can be considered to be independent 
(although their average electrostatic repulsion is partially taken into account by the 
potential V.). The energy levels of H, then define the electronic configurations 
which we shall study in this section. We shall then examine the effect of W by using 
Stationary perturbation theory in §2. 


b. THE GROUND STATE CONFIGURATION AND FIRST EXCITED CONFIGURATIONS 


According to the discussion of complement Axy ($ 2), the configurations of 
the helium atom are specified by the quantum numbers », / and n’, l of the two 
electrons (placed in the central potential V.). The corresponding energy E, can be 
written : 


E, = Es. + Epe (4) 


Thus (fig. 1), the ground state configuration, written 1s*, is obtained when the two 
electrons are in the 1s shell; the first excited configuration Is, 2s, when one electron 
is in the 1s shell and the other one is in the 2s shell. Similarly, the second excited 
configuration is the Is, 2p configuration. 


Is,2p 
Is,25 
FIGURE 1 
> The ground state configuration and first excited configurations 
Is of the helium atom (the energies are not shown to scale). 


The excited configurations of the helium atom are of the form 1s, n'l’. Actually, there 
also exist “doubly excited” configurations of the type nl, n'l (with n, n’ > 1). But, for helium, 
their energy is greater than the ionization energy £, of the atom (the limit of the energy of the 
configuration 1s, n'l when n’ —> œ). Most of the corresponding states, therefore, are very 
unstable : they tend to dissociate rapidly into an ion and an electron and are called “auto- 
ionizing states”. However, there exist levels belonging to doubly excited configurations which 
are not autoionizing, but which decay by emitting photons. Some of the corresponding spectral 
lines have been observed experimentally. 
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c. DEGENERACY OF THE CONFIGURATIONS 


Since V is central and not spin-dependent, the energy of a configuration does 
not depend on the magnetic quantum numbersmandm'(-—-/<m<l!l-—Il<m'<!Í) 
or on the spin quantum numbers £ and ¢’ (e = +, ¢ = +) associated with the two 
electrons. Most of the configurations, therefore, are degenerate; it is this degeneracy 
which we shall now calculate. 

A state belonging to a configuration is defined by specifying the four quantum 
numbers (n, /, m, e) and (n’, I’, m', e') of each electron. Since the electrons are 
identical particles, the symmetrization postulate must be taken into account. The 
physical ket associated with this state can, according to the results of §C-3-b of 
chapter XIV, be written in the form: 


|n, l,m, e;n, l, m,e > E — P )| 1:n, l,m, 8;2:n', l,m, E > (5) 
2 


Pauli’s principle excludes the states of the system for which the two electrons would 
be in the same individual quantum state (n = n', l = l’, m = m', £ = eg”). According 
to the discussion of § C-3-b of chapter XIV, the set of physical kets (5) for which 
n, l, n', I are fixed and which are not null (that is, not excluded by Pauli’s principle) 
constitute an orthonormal basis in the subspace &(n, I; n’, /') of €, associated with 
the configuration nl, n't. 

To evaluate the degeneracy of a configuration n/, n'l, we shall distinguish 
between two cases: 


(i) The two electrons are not in the same shell (we do not have n = n' and 
i=). 

The individual states of the two electrons can never coincide, and m, m’, e, £ 
can independently take on any value. The degeneracy of the configuration, conse- 
quently, is equal to: 


2(21 + 1) xX 2(Q1 + 1) = 4(2/ + DQr +1) (6) 


The Is, 2s and ls, 2p configurations enter into this category; their degeneracies are 
equal to 4 and 12 respectively. 


(ii) The two electrons are in the same shell (n = n' and / = /'). 

In this case, the states for which m = m' and £ = £' must be excluded. Since 
the number of distinct individual quantum states is equal to 2(2/ + 1), the degree of 
degeneracy of the n’? configuration is equal to the number of pairs that can be formed 
from these individual states (cf. $ C-3-b of chapter XIV), that is: 


Cherri = (22 + Ml + 1) (7) 

Thus, the 15? configuration, which enters into this category, is not degenerate. 
It is useful to expand the Slater determinant corresponding to this configuration. 
If, in (5), we set n = n' =1,/=l=m=m' =0, £= +, € = —, we obtain, 
writing the spatial part as a common factor: 
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|is?> =]1 1,0,052:1,0,0) ® (| 1:4+;2:->)-|1:-;2:4)) (8) 
2 

In the spin part of (8), we recognize the expression for the singlet state 

|S = 0, Ms = 0)», where S and Mg are the quantum numbers related to the 

total spin S = S, + S, (cf. chap. X, §B-4). Thus, although the Hamiltonian H, 

does not depend on the spins, the constraints introduced by the symmetrization 

postulate require the total spin of the ground state to have the value S = 0. 


2. The effect of the inter-electron electrostatic repulsion: 
exchange energy, spectral terms 


We shall now study the effect of W by using stationary perturbation theory. 
To do so, we must diagonalize the restriction of W inside the subspace £ (n, /;n'. 1) 
associated with the n/, n'l’ configuration. The eigenvalues of the corresponding 
matrix give the corrections of the configuration energy E, to first order in W: 
the associated eigenstates are the zero-order eigenstates. 

To calculate the matrix which represents W inside &(n, /: n’, ['), we can 
choose any basis, in particular, the basis of kets (5). Actually, it is to our advantage 
to use a basis which is well adapted to the symmetries of W. We shall see that 
we can choose a basis in which the restriction of W is already diagonal. 


a. CHOICE OF A BASIS OF 6(n,/;7n',/') ADAPTED TO THE SYMMETRIES OF W 


o. Total orbital momentum L and total spin S 


W does not commute with the individual orbital angular momenta L, and L, 
of each electron. However, we have already shown (cf. chap. X. §A-2) that, 
if L denotes the total orbital angular momentum: 


L=L, +L, (9) 
we have: 
ya 
[W, L] = E L = 0 (10) 
Ry2 


Therefore, L is a constant of the motion*. Moreover, since W does not act in the 
spin state space, this is also true for the total spin S: 


[w,S] =0 (11) 


Now, consider the set of four operators, L?, S?, L., S.. They commute with 
each other and with W. We shall show that they constitute a C.S.C.O. in the 
subspace $ (n, l: n’, 1") of &,. This will enable us in $ b to find directly the eigenvalues 
of the restriction of W in this subspace. 


* This result is related to the fact that, under a rotation involving both electrons, the distance 
between them, R,,, is invariant. However, it changes if only one of the two electrons is rotated. This 
is why W commutes with neither L, nor L,. 
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To do this, we shall return to the space $, the tensor product of the state 
spaces &(1) and &(2) relative to the two electrons, numbered arbitrarily. The 
subspace £ (n, 1; n', l') of &, associated with the nl, n'l’ configuration can be obtained 
by antisymmetrizing the various kets of the subspace &, (1) © £p ,.(2) of &*. If we 
choose the basis | 1 : n, 4, m, € > Q |2 : n’, /', m', e” > in this subspace, we obtain 
the basis of physical kets (5) by antisymmetrization. 

However, we know from the results of chapter X that we can also choose 
in &, ,(1) O Enr (2) another basis composed of common eigenvectors of L?, L,, 
S?, S, and entirely defined by the specification of the corresponding eigenvalues. 
We shall write this basis: 


(|l:n 32:11, l; L, M> @|S.Ms>} (12) 


with: 


L=1 +1 1+P=1),., =P 
(13) 
S=1,0 


Since L?, L,, S*, S, are all symmetric operators (they commute with P,,), the 
vectors (12) remain, after antisymmetrization, eigenvectors of L?, £,, S?, S, with 
the same eigenvalues (some of them may, of course, have a zero projection onto & 4. 
in which case the corresponding physical states are excluded by Pauli’s principle: 
see § B below). The non-zero kets obtained by antisymmetrization of (12) are 
therefore orthogonal, since they correspond to different eigenvalues of at least one 
of the four observables under consideration. Since they span &(n, /; n', l’), they 
constitute an orthonormal basis of this subspace, which we shall write: 


lala, l: L, Mi; S, Ms >} (14) 
with: 


|n, d; n'l; L, My; S, Ms > 
= e(l — Pr) f| 1:7, 132: l; L, Mp > @|S,Ms>} (15) 

where c is a normalization constant. L?, L,, S?, S, therefore form a C.S.C.O. 
inside &(n, L; n’, I’). 

Now, we shall introduce the permutation operator P$? in the spin state 
space: 

P| deg 2ee a lee (16) 
We showed in $B-4 of chapter X [cf. comment (¿i)] that: 

PS) | S, Ms > = (- 1987! |S. Ms > (17) 


* We could also start with the subspace 6,.,(1) 9 €, (2) [ef. comment (i) of §B-2-c of 
chapter XIV, p. 1386]. 
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Furthermore, if P$§) is the permutation operator in the state space of the orbital 


variables, we have: 
Psy = PS O PE (18) 
Using (17) and (18), we can, finally, put (15) in the form: 


[adn > L.M, S, Ms > 
=e (Ll — (- DP] liad; 2:n l; L, M,>0 |S, Ms > (19) 


B. Constraints imposed by the svmmetrization postulate 


We have seen that the dimension of &(n, /; n’, l’) is not always equal to 
4(2/ + 1)(2/’ + 1), that is, to the dimension of 6, (1) © @,, (2). Certain kets of 
6, ,(1) @ En (2) can therefore have a zero projection onto &(n, /: n’, I’). It is 
interesting to study the consequences for the basis (14) of this constraint imposed 
by the symmeirization postulate. 

First of all, assume that the two electrons do not occupy the same shell. 
It is then easy to see that the orbital part of (19) is a sum or a difference of two 
orthogonal kets and. consequently, is never zero*. Since the same is true of | S, Ms >, 
we see that all the possible values of L and S [cf. formula (13)] are allowed. For 
example, for the Is, 2s configuration, we can have S = 0, L = OandS =1,L =0; 
for the Is, 2p configuration. we can have S = 0, L = |l and S = 1. L = H etc. 

If we now assume that the two electrons occupy the same shell, we have 
n=." and /= l’, and certain of the kets (19) can be zero. Let us write 
Lin, 132 :.1, 1: L, M, > in the form: 


lona Li 2 3H, PL, Mz > 


= YY Chima’ 


L.M¿>l1:n,Lm;2:.1,1,m > (20) 


According to relation (25) of complement B,: 

Cidimom' | LM, > = (<li; n m| L,M, > (21) 
By using (20), we then obtain: 

PO (Tin dilon EM, > = (= DP lin hi2tn LM, > (22) 
Substituting this result into (19), we obtain**: 


0 if L + Sisod 
In lin tL. M,:S, M5) = eee (23) 
|1:n,1:2:n,1,L,M,>0|S. Ms> 


if L + Sis even 


Therefore, L and S cannot be arbitrary : L + S must be even. In particular, for 
the 1s? configuration, we must have L = 0, so S = 1 is excluded. This is a result 
found previously. 


het . l 
* The normalization constant e is then equal to —=- 
> 


** The normalization constant is then 1.2. 
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Finally, note that the symmetrization postulate introduces a close correlation 
between the symmetry of the orbital part and that of the spin part of the physical 
ket (19). Since the total ket must be antisymmetric, and the spin part, depending 
on the value of S, is symmetric (S = 1) or antisymmetric (S = 0), the orbital part 
must be antisymmetric when S = | and symmetric when S = 0. We shall see the 
importance of this point later. 


b. SPECTRAL TERMS. SPECTROSCOPIC NOTATION 


W commutes with the four observables L?, L,, S?, S,, which form a C.S.C.O. 
inside &(n, /; n', 1’). It follows that the restriction of W inside &(n, /: n’, I’) is 
diagonal in the basis: 


Lia, lor, UL, My: S, Ms > y 
and has eigenvalues of: 
(L,S) = <n, l; n’, l’; L, M,3S, Ms | W|n,lonm, USL, M:S, Mg») (24) 


This energy depends neither on M, nor on Mg, since relations (10) and (11) imply 
that W commutes not only with L, and S, but also with L, and S,: W is therefore 
a scalar operator in both the orbital state space and the spin state space 
(cf. complement By,. $85-b and 6-c). 

Inside each nl, n'l configuration, we thus obtain energy levels 
E.(n, l; n', l) + 6(L, S), labeled by their values of L and S. Each of them is 
(2L + 1)(25 + 1)-fold degenerate. Such levels are called spectral terms and 
denoted in the following way. With each value of L is associated. in spectroscopic 
notation (chap. VII, §C-4-b) a letter of the alphabet; we write the corresponding 
capital letter and add, at the upper left, a number equal to 2S + 1. For example, 
the 15? configuration leads to a single spectral term, written 'S (the °S, as we have 
seen, is forbidden by Pauli's principle). The 1s, 2s configuration produces two terms. 
1S (non-degenerate) and *S (three-fold degenerate) : the Is, 2p configuration. two 
terms, *P (degeneracy 3) and °P (degeneracy 9). For a more complicated configura- 
tion such as, for example, 2p?, we obtain (cf. $ 2-a-B) the spectral terms 'S, 'D 
and °P (L + S must be even), etc. 

Under the effect of the electrostatic repulsion, the degeneracy of each 
configuration is therefore partially removed (the Is? configuration, which is non- 
degenerate, is simply shifted). We shall study this effect in greater detail in the 
simple example of the Is, 2s configuration. We shall try to understand why the two 
terms 'S and 3S resulting from this configuration. and whose total spin values are 
different, have different energies although the original Hamiltonian is purely 
electrostatic. 


c. DISCUSSION 

a. Energies of the spectral terms arising from the \s, 2s configuration 
In the ls, 2s configuration, = /’ = L = 0. lt is then easy to obtain from (20): 
|l:n=1l,/=0;2:n=2, l =0; L=M, =0>)= (25) 
ll:n=1,/=m=0;2:n'=2,l'=m'=0) 
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a vector which we shall write, more simply, | 1 : 15,2 : 2s >. If | °S, M¿>and|'S,0> 
denote the states corresponding to the two spectral terms °S and 'S arising from 
the Is, 2s configuration, we obtain, substituting (25) into (19): 


V2 


|'S,0> =z + PY)|1:1s;2:2s>] @|S =0,M,=0) (26-b) 
2 


35, My > =—=[(1 = PC) | 1: 1s;2:2s>] @|S = 1, Ms) (26-a) 


Since W does not act on the spin variables, the eigenvalues given by (24) can be 
written: 


(1 :ds:2:2s] (1 — PY) W (1 — PO)]1:1s:2:2s)> (27-a) 
(1:1s5:2:2s](1 + PP) W (1 +PS)|1:1532:25> (27-b) 


(we have used the fact that P§9’ is Hermitian). Moreover, PS°’ commutes with W, 
and the square of P/%? is the identity operator. Therefore: 


(1 + PE) W (L + PS) = (1 + PE) W = 21 + PY) W (28) 


Finally, we obtain: 


OS) =K -J (29-a) 
ee E (29-b) 
with: 
K=(«(1:15:2:2s|W|1:15;2:25> (30) 
J=(1:15:2:25 | P0W|1:15,2:25) 
=<1:2s;2:ls|W|1:1s;2:2s> (31) 


K therefore represents an overall shift of the energy of the two terms and does not 
contribute to their separation. J is more interesting, as it introduces an energy 
difference between the 3S and !S terms (cf. fig. 2). We shall therefore study it in a 
little more detail. 


/ FIGURE 2 
2J =08eV The relative position of the spec- 


Mi tral terms 'S and °S arising from 
the is, 2s configuration of the 


LS K helium atom. K represents an ove- 

E rall shift of the configuration. The 

VA removal of the degeneracy is pro- 
ats 2s t portional to the exchange integral J 
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Pp. The exchange integral 


When we substitute expression (3) for W into (31), there appear terms of the 
form: 


€1:25;2; Is | K(R,)[1:15;2:25> 
=< lid] V1R,) | eis >< 22 181252532) 


2: 1s >and |2 : 2s > is zero. 
Expression (32) is then equal to zero. The same type of reasoning shows that the 
terms which arise from the operators V.(R,), — 2e7/R,, — 2e?/R, are also zero, 
since each of these operators acts only in the single-electron spaces while the state 
of the two electrons is different in the ket and bra of (31). Finally, there remains: 


2 


J =(1:25:2:18| pt 
1 2 


Asi 2 S255 (33) 
J therefore involves only the electrostatic repulsion between the electrons. 

Let Pntm(0) be the wave functions associated with the states |n, l,m) 
(the stationary states of an electron in the central potential V,): 


On.1.m(1) = < r | A, l, m > (34) 


In the {|r > } representation, the calculation of J from (33) yields: 
J = je Pr E ry Poor) P$ oot) mn r, r| P1.0,0(F 1) P2,0.0(82) (35) 


This integral is called the “exchange integral”. We shall not calculate it explicitly 
here; we point out, however, that it is positive. 


Y. The physical origin of the energy difference between the two spectral terms 


We see from expressions (26) and (27) that the origin of the energy separation 
of the ¿S and 'S terms lies in the symmetry differences of the orbital parts of these 
terms. As we emphasized at the end of $ 2-a, a triplet term (S = 1) must have an 
orbital part which is antisymmetric under exchange of the two electrons ; hence the 
— sign before P/% in (26-a) and (27-a). On the other hand. a singlet term (S = 0) 
must have a symmetric orbital part [ + sign in (26-b) and (27-b)]. 

This explains the relative position of the °S and 'S terms shown in figure 2. 
For the singlet term, the orbital wave function is symmetric with respect to 
exchange of the two electrons, which then have a non-zero probability of being 
at the same point in space. This is why the electrostatic repulsion, which gives 
an energy of e?/r,, which is large when the electrons are near each other, 
significantly increases the singlet state energy. On the other hand, for the triplet 
state, the orbital function is antisymmetric with respect to exchange of the two 
electrons, which then have a zero probability of being at the same point in space. 
The mean value of the electrostatic repulsion is then smaller. Therefore. the energy 
difference between the singlet and triplet states arises from the fact that the 
correlations between the orbital variables of the two electrons depend, because of 
the symmetrization postulate, on the value of the total spin. 
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ò. Analysis of the role played by the symmetrization postulate 


At this point in the discussion, it might be thought that the degeneracy of a 
configuration is removed by the symmetrization postulate. We shall show* that 
this is not the case. This postulate merely fixes the value of the total spin of the 
terms arising from a given configuration (because of the inter-electron electrostatic 
repulsion). 

To see this, imagine for a moment that we do not need to apply the symmetri- 
zation postulate. Suppose. for example. that the two electrons are replaced by two 
particles (fictitious, of course) of the same mass. the same charge and the same 
spin as the electrons but with another intrinsic property which permits us to 
distinguish between them [without. however. changing the Hamiltonian H of the 
problem. which is still given by formula (1)]. Since H is not spin-dependent and 
we do not have to apply the symmetrization postulate, we can ignore the spins 
completely until the end of the calculations, and then multiply the degeneracies 
obtained by 4. The energy level of Hp corresponding to the ls, 2s configuration ts 
two-fold degenerate from the orbital point of view because two orthogonal states. 
| |: dy: 2: 2y > and | 1: 2s: 2: 1s > correspond to it (they are different physical 
states since the two particles are of different natures). To study the effect of W, 
we must diagonalize W in the two-dimensional space spanned by these two kets. 
The corresponding matrix can be written : 


E J 00 
J K 


where J and K are given by (30) and (31) [the two diagonal elements of (36) are 
equal because MW is invariant under permutation of the two particles]. Matrix (36) 
can be diagonalized immediately. The eigenvalues found are K + J and K — J, 
associated respectively with the symmetric and antisymmetric linear combinations 
of the two kets | i : Is: 2 : 2s > and | I : 2s; 2 : 1s >. The fact that these orbital 
eigenstates have well-defined symmetries relative to exchange of the two particles has 
nothing to do with Paulis principle. It arises only from the fact that W commutes 
with PL (common eigenstates of W and P3"' can therefore be found). 

When the two particles are not identical. we obtain the same arrangement of 
levels and the same orbital symmetry as belore. On the other hand. the degeneracy of 
the levels is obviously different: the lower level. with energy K - /. can have a total 
spin of either S = 0 or S = I, as can the upper level. 

If we return to the real helium atom. we now see very clearly the role played 
by Pauli’s principle. It is not responsible for the splitting of the initial level Is, 2s into 
the two energy levels K + J and A — J, since this splitting would also appear for 
two particles of different natures. Similarly. the symmetric or antisymmetric 
character of the orbital part of the eigenvectors Is related to the invariance of the 
electrostatic interaction under permuation of the two electrons. Paulis principle 
merely forbids the lower state to have a total spin S = 9 and the upper state to have 
a total spin S = l, since the corresponding states would be globally symmetric. 
which is unacceptable for fermions. 


* See also comment (i) of $ C-4-a-B of chapter XIV. p. 1395. 
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£. The effective spin-dependent Hamiltonian 
We replace W by the operator: 
W=2+8S,.8, (37) 
where S, and S, denote the two electron spins. We also have: 


E 2 
E 


B a 
+5 38 
413° (38) 
so that the eigenstates of W are the triplet states, with the eigenvalue x + BR?/4, 
and the singlet state, with the eigenvalue « — 38h?/4. Therefore, if we set : 


(39) 
p => a 


we obtain, by diagonalizing W. the same eigenstates and eigenvalues we found 
above*. We can then consider that it is as if the perturbation responsible for the 
appearance of the terms were W (the “effective” Hamiltonian), which is of the same 
form as the magnetic interaction between two spins. However, one should not 
conclude that the coupling energy between the electrons, which is responsible for 
the appearance of the two terms, is of magnetic origin : two magnetic moments 
equal to that of the electron and placed at a distance of the order of 1 Á from each 
other would have an interaction energy much smaller than J. However, because 
of the very simple form of w, this effective Hamiltonian is often used instead of W. 

An analogous situation arises in the study of ferromagnetic materials. In these 
substances, the electron spins tend to align themselves parallel to each other. 
Since the spin state is then completely symmetric, Pauli’s principle requires the 
orbital state to be completely antisymmetric. For the same reasons as for the 
helium atom, the electronic repulsion energy is then minimal. When we study 
such phenomena, we often use effective Hamiltonians of the same type as (37). 
However, it must be noted that the physical interaction which is at the origin 
of the coupling is again electrostatic and not magnetic. 


COMMENTS: 


(1) The ls, 2p configuration can be treated in the same way. We then have 
L = l, so that M; = + 1,0 or — 1. As for the Is, 2s configuration, the 
shells occupied by the two electrons are different, so that the two terms *P 
and 'P exist simultaneously. The first one is nine-fold degenerate, and the 
second, three-fold. It can be shown, as above, that the °P term has an energy 
lower than that of the *P term, and the difference between the two energies 
is proportional to an exchange integral which is analogous to the one written 


* We must, obviously, keep only the eigenvectors of W which belong to Éa 
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in (35). We would proceed in the same way for all other configurations 
of the type Is, n't. 


(i) We have treated W like a perturbation of Ho. For this approach to be coherent, the energy 
shifts associated with W [for example, the exchange integral written in (35)] must be 
much smaller than the energy differences between configurations. Now, this is not the 
case; for the 15,25 and 1s,2p configurations, for example, while the energy difference 
AE('S — 3S) in the 15,2s configuration is of the order of 0.8 eV, the minimum distance 
between levels is 4E[(1s.2pPP — (1s.2s)'S] = 0.35 eV. We might therefore believe 
that it is not valid to treat W like a perturbation of Ho. 

However, the approach we have given is correct. This is due to the fact that, for 
all configurations of the type Is,n‘/’, we have L = l. Therefore, W, which, according 
to (10), commutes with L, has zero matrix elements between the states of the 1s,2s confi- 
guration and those of the l.s, 2p configuration, which correspond to different values of L. 
W couples a lsn?’ configuration only to configurations with distinctly higher energies, 
of the 1s.1”/" type with /” = 7 (only the values of n are different) or of the ni,n’/’ type, 
with n and n” different from | (the angular momenta / and /” can be added to give /’). 


3. Fine-structure levels ; multiplets 


Thus far, we have taken into account in the Hamiltonian only interactions 
of purely electrostatic origin: we have neglected all effects of relativistic and 
magnetic origin. Actually, such effects exist, and we have already studied them in 
the case of the hydrogen atom (cf. chap. XII, $ B-1), where they arise from the 
variation of the electron mass with the velocity, from the L . S spin-orbit coupling, 
and from the Darwin term. For helium, the situation is more complicated because 
of the simultaneous presence of two electrons. For example, there is a spin-spin 
magnetic coupling term in the Hamiltonian (cf. complement B,,) which acts in both 
the spin state space and the orbital state space of the two electrons*. Nevertheless, 
a great simplification arises from the fact that the energy differences associated 
with these couplings of relativistic and magnetic origin are much weaker than those 
which exist between two different spectral terms. This enables us to treat the 
corresponding Hamiltonian (the fine-structure Hamiltonian) like a perturbation. 

The detailed study of the fine structure levels of helium falls outside the domain 
of this complement. We shall confine ourselves here to describing the symmetries 
of the problem and indicating how to distinguish between the different energy levels. 
We shall use the fact that the fine-structure Hamiltonian Hs is invariant under a 
simultaneous rotation of all the orbital and spin variables. This means (cf. comple- 
ment By,,, $ 6) that, if J denotes the total angular momentum of the electrons: 


J=L+S (40) 
we have 
[Hsr] =0 (41) 


* See for example $ 19.6 in Sobel'man (11.12) for an explicit expression of the different terms of 
the fine structure Hamiltonian (Breit Hamiltonian). 
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On the other hand. the fine-structure Hamiltonian changes if the rotation acts 
only on the orbital variables or only on the spins : 


[Hsr L] = — [Hsr S] # 0 (42) 


These properties can easily be seen for the operators Y ¢(r,)L;. S,. for example. 


or for the dipole-dipole magnetic interaction Hamiltonian (cf. complement B,,). 
The state space associated with a term is spanned by the states 
|a din l L M: S, Mg > written in (19), where L and S are fixed, and where: 


LS+L 


boR a 
sS œ a 


In this subspace, it can be shown that J? and J. form a C.S.C.O. which, according 
to (41), commutes with H,,. The eigenvectors | J. M, > common to J? [eigenvalue 
J(J + 1)h7] and J. (eigenvalue M,h) are therefore necessarily eigenvectors of Hyp, 
with an eigenvalue which depends on J but not on M, (this last property arises 
from the fact that H,, commutes with J, and J_). According to the general theory 
of addition of angular momenta, the possible values of J are: 


J=L+S.L4+S—-1,L4+S5-—3,....|L—S| (44) 


The effect of H,,. is therefore a partial removal of the degeneracy. For each 
“term”, there appear as many distinct levels as there are different values of J, 
according to relation (44). Each of these levels is (24 + 1)- fold degenerate and is called 
a “multiplet”. The usual spectroscopic notation consists of denoting a multiplet by 
adding a right lower index equal to the value of / to the symbol representing the term 
from which it arises. For example. the ground state of the helium atom gives a single 
multiplet, 'S,. Similarly, each of the terms 'S and °S of the 1s, 2s configuration leads 
to a single multiplet: 'S, and *S,. respectively. On the other hand, the *P term 
arising from ls, 2p yields three multiplets, *P,. °P, and *P, (cf. fig. 3). and so on. 
We point out that the measurement and theoretical calculation of the fine structure 
of the °P level of the Is, 2p configuration is of great fundamental interest, since it 
can lead to the very precise knowledge of the ~ fine structure constant”. g = e7 ‘hc. 


COMMENTS: 
(7) For many atoms, the fine-structure Hamiltonian is essentially given by: 
N 


Hyp = Y AR)L,.S; (45) 


iol 


where R,;, L; and S; denote the positions, angular momenta and spins of each of the 
N electrons. It can then be shown, using the Wigner-Eckart theorem (cf. complement Dy). 
that the energy of the J multiplet is proportional to JJ + 1) — LIL + 1) — S(S + 1). 
This result is sometimes called the “Landé interval rule”. 

For helium. the °P, and *P, levels arising from the Ly.2p~ configuration are much 
closer than would be predicted by this rule. This arises from the importance of the dipole- 
dipole magnetic coupling of the spins of the two electrons. 


1430 


www.elsolucionario.net 


ELECTRON GAS. APPLICATION TO SOLIDS g 


A 


/ 0.25 eV 
ls 2p | 
e aniaden! 
ip, 
esas 
3p y sp, 1 1.210°*eV 
ee See 
XA 110 F eV 
1p; 
FIGURE 3 


The relative position of the spectral terms and multiplets arising from the 1s,2p configuration of 
the helium atom (the splitting of the three multiplets +P,, *P,, °P, has been greatly exaggerated 
in order to make the figure clearer). 


(ii) In this complement, we have neglected the “hyperfine effects” related to 
nuclear spin (cf. chap. XII. $B-2). Such effects actually exist only for the *He 
isotope, whose nucleus has a spin / = 1/2 (the nucleus of the *He isotope 
has a zero spin). Each multiplet of electronic angular momentum J splits. in the 
case of *He, into two hyperfine levels of total angular momentum F = J + 1/2, 
(2F + 1)-fold degenerate (unless, of course, J = 0). 


References and suggestions for further reading: 


Kuhn (11.1), chap. HI-B; Slater (11.8). chap. 18; Bethe and Salpeter (11.10). 

Multiplet theory and the Pauli principle: Landau and Lifshitz (1.19). §§ 64 and 65; 
Slater (1.6), chap. 7 and (11.8), chap. 13; Kuhn (11.1), chap V. $A: Sobel’man (11.12), 
chap. 2. $5.3. 
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Complement Cxıv 


PHYSICAL PROPERTIES OF AN ELECTRON GAS. 
APPLICATION TO SOLIDS 


I. Free electrons enclosed in a “box” 
a. Ground state of an electron gas; Fermi energy Ep 
b. Importance of the electrons with energies close to Ey 
c. Periodic boundary conditions 


2. Electrons in solids 


a. Allowed bands 
b. Position of the Fermi level and electrical conductivity 


In complements Ax and By. we studied, taking the svmmetrization postulate 
into account, the energy levels of a small number of independent electrons placed 
in a central potential (the shell model of many-electron atoms). Now, we shall 
consider systems composed of a much larger number of electrons, and we shall 
show that Pauli’s exclusion principle has an equally spectacular effect on their 
behavior. 

To simplify the discussion, we shall neglect interactions between electrons. 
Moreover, we shall assume, at first (§ 1), that they are subject to no external potential 
other than the one that restricts them to a given volume and which exists only in the 
immediate vicinity of the boundary (a free electrons gas enclosed in a “ box”). 
We shall introduce the important concept of the Fermi energy Ep, which depends 
only on the number of electrons per unit volume. We shall also show that the 
physical properties of the electron gas (specific heat, magnetic susceptibility, ...) are 
essentially determined by the electrons whose energy is close to Ep. 

A free-electron model describes the principal properties of certain metals rather 
well. However the electrons of a solid are actually subject to the periodic 
potential created by the ions of the crystal. We know that the energy levels of each 
electron are then grouped into allowed energy bands, separated by forbidden bands 
(cf. complements Fy; and O,,,). We shall show qualitatively in $ 2 that the electric 
conductivity of a solid is essentially determined by the position of the Fermi level 
of the electron system relative to the allowed energy bands. Depending on this 
position, the solid is an insulator or a conductor. 
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1. Free electrons enclosed in a box 


a. GROUND STATE OF AN ELECTRON GAS: FERMI ENERGY £; 


Consider a system of N electrons, whose mutual interactions we shall neglect, 
and which, furthermore, are subjected to no external potential. These N electrons, 
however, are enclosed in a box, which, for simplicity, we shall choose to be a cube 
with edges of length L. 

If the electrons cannot pass through the walls of the box, it is because the walls 
constitute practically infinite potential barriers. Since the potential energy of the 
electrons is zero inside the box, the problem is reduced to that of the three- 
dimensional infinite square well (cf. complements G, and H,). The stationary states 
of a particle in such a well are described by the wave functions: 


(r) = = ay n,—)\ sin ( n, ZË \ sin ( nn, (1-a) 
Pnxny.nz = L x L YL ZL 


nonan, = 1, 2, 3.. (1-b) 


[expression (1-a) is valid for 0 < x, y, z < L, since the wave function is zero outside 


this region]. The energy associated with q, , is equal to: 
242 
mh 2 2 2 
= no tn +n; 
"ansta nd? (n i =) (2) 


Of course, the electron spin must be taken into account: each of the wave 
functions (1) describes the spatial part of two distinct stationary states which differ 
by their spin orientation ; these two states correspond to the same energy, since the 
Hamiltonian of the problem is spin-independent. 

The set of these stationary states constitutes a discrete basis, enabling us to 
construct any state of an electron enclosed in this box (that is, whose wave function 
goes to zero at the walls). Note that, by increasing the dimensions of the box, 
we can make the interval between two consecutive individual energies as small as we 
wish, since this interval is inversely proportional to £?. If L is sufficiently large, 
therefore, we cannot, in practice, distinguish between the discrete spectrum (2) 
and a continuous spectrum containing all the positive values of the energy. 

The ground state of the system of the N independent electrons can be obtained 
by antisymmetrizing the tensor product of the N individual states associated with 
the lowest energies compatible with Pauli’s principle. If N is small, it is thus simple 
to fill the first individual levels (2) and to find the ground state of the system, as 
well as its degree of degeneracy and the antisymmetrized kets which correspond 
to it. However, when N is much larger than 1 (in a macroscopic solid, N is of the 
order of 1023), this method cannot be used in practice, and we must use a more 
global reasoning. 

We shall begin by evaluating the number n(£) of individual stationary states 
whose energies are lower than a given value E. To do so, we shall write expression (2) 
for the possible energies in the form: 

2 
-2g 6) 


NxN yħz 2m Nx My. Ne 
Mo 
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with: 
T 
CA A = nT 
T 
(kynn), = WE 
T 
(kp n,n) = NT (4) 


According to (1), a vector K, nyn, corresponds to each function @,ny.n (1). 
Conversely, to each of these vectors, there corresponds one and only one 
function Q,,...,.» The number of states n(E) can then be obtained by multiplying 


by 2 the number of vectors K, n,n, whose modulus is smaller than V 2m, E/h? (the 
factor 2 arises, of course, from the existence of electron spin). The tips of the 
vectors k, n,n, divide k-space into elementary cubes of edge z/L (see figure |, in 


which, for simplicity, a two-dimensional rather than a three-dimensional space is 


o, 
x x x x x x 
x x x x x x 
x x x x x x 
x x x x x x 
x x x x x x 
FIGURE Í 
n/L + x x x x x x 
Tips of the vectors k, „characterizing the 
i » (k), stationary wave functions in a two-dimen- 
n/L sional infinite square well, 


shown). Each of these tips is common to eight neighboring cubes, and each cube 
has eight corners. Consequently, if the elementary cubes are sufficiently small 
(that is, if L is sufficiently large). there can be considered to be one vector k, 
per volume element (z/L)* of k-space. 


ty z 


sphere is involved, since the components of k are positive [ cf: (1-b) and (4)]. If we 
divide it by the volume element (1/L)? associated with each stationary state, and 
if we take into account the factor 2 due to the spin, we obtain: 


4 32 ay 32 
Po eae all sees LS (5) 
83 \ p2 (LP 3? nm? 
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This result enables us to calculate immediately the maximal individual energy 
of an electron in the ground state of the system, that is, the Fermi energy E, of 
the electron gas. This energy E, satisfies: 


n(Es) =N (6) 


which gives: 


h? Ny" 
= — (3r? — 
Ee 2m, (: úl 5) (7) 


Note that, as might be expected, the Fermi energy depends only on the number N/L? 
of electrons per unit volume. At absolute zero, all the individual states of energy less 
than Ep are occupied, and all those whose energies are greater than E, are empty. 
We shall see in $ 1-b what happens at non-zero temperatures. 

We can also deduce the density of states p(E) from (5). p(E) dE is, by definition, 
the number of states whose energies are included between E and E + dE. This state 
density, as we shall see later, is of considerable physical importance. lt can be 
obtained simply by differentiating n(E) with respect to E: 


p(E) = dn(E) = e) E! (8) 


dE 2n* \ hi? 


p(E) therefore varies like JE. At absolute zero, the number of electrons with a 
given energy between E and E + dE (less than Ep, of course) is equal to p (E) dE. 
By using the value (7) of the Fermi energy Ep, we can put p(£) in the form: 


3, El? 
AE) => N= (9) 
F 
COMMENT: 


It can be seen from (5) that the dimensions of the box are involved only through 
the intermediary of the volume element (n/L)? associated, in k-space, with each stationary 
state. If. instead of choosing a cubic box of edge L, we had considered a parallelepiped 
of edges L,, L,, L}, we would have obtained a volume element of 27/L,L,L, : only 
the volume L, L L, of the box, therefore, enters into the density of states. This result can 
be shown to remain valid, whatever the exact form of the box, provided it is sufficiently 
large. 


b. IMPORTANCE OF THE ELECTRONS WITH ENERGIES CLOSE TO £; 


The results obtained in the preceding section make it possible to understand 
the physical properties of a free electron gas. We shall give two simple examples 
here, that of the specific heat and that of the magnetic susceptibility of the system. 
We shall confine ourselves, however, to semi-quantitative arguments which simply 
illustrate the fundamental importance of Pauli’s exclusion principle. 
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a. Specific heat 


At absolute zero, the electron gas is in its ground state: all the individual 
levels of energy less than E, are occupied, and all the others are empty. Taking 
into account the form (8) of the density of states p(E), we can represent the 
situation schematically as in figure 2-a: the number v(E) dE of electrons with an 
energy between E and E + dE is p(E) dE for E < Ep and zero for E > Ep. What 
happens if the temperature T is low but not strictly zero? 


A v(E) v(E) =k 


1 
13 
H 
4 
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| 
| 
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l 
i 
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FIGURE 2 


Variation of v(E) with respect to E [v(E)dE is the number of electrons with energy between E 
and E + dE]. At absolute zero, all the levels whose energies are less than the Fermi energy £, are 
occupied (fig. a). At a slightly higher temperature 7, the transition between empty and occupied 
levels occurs over an energy interval of a few kT (fig. b). 


If the electrons obeyed classical mechanics, each of them, in going from 
absolute zero to the temperature 7, would gain an energy of the order. of kT 
(where k is the Boltzmann constant). The total energy per unit volume of the electron 
gas would then be approximately : 


um =~ er (10) 


which would give a constant volume specific heat 0U,,/OT which is independent of 
the temperature. 

In reality, the physical phenomena are totally different, since Pauli’s principle 
prevents most of the electrons from gaining energy. For an electron whose initial 
energy E is much less than E, (more precisely, if E, — E > kT). the states to which 
it could go if its energy increased by AT are already occupied and are therefore 
forbidden to it. Only electrons having an initial energy E close to E, (Ep — E = kT) 
can “heat up”, as shown by figure 2-b. The number of these electrons is approxima- 
tely: 


AN ~ p(Ep) kT =2N= (11) 
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[according to (9)]. Since the energy of each one increases by about kT, the total 
energy per unit volume can be written: 
N kT 
U(T) = —— kT (12) 
L Ep 


instead of the classical expression, (10). Consequently, the constant volume specific 
heat is proportional to the absolute temperature T: 


A (13) 


For a metal, to which the free-electron model can be applied, E, is typically on the 
order of a few 2V. Since AT is about 0.03 eV at ordinary temperatures, we see that 
in this case the factor k T/Ep introduced by Pauli’s principle is of the order of 1/190. 


COMMENTS: 


(i) In order to calculate the specific heat of the electron gas quantitatively, we must know the 
probability / (E, T) for an individual state of energy E to be occupied when the system is 
at thermodynamic equilibrium at the temperature T. The number v(£) dE of electrons 
whose energies are included between E and E + dE is then: 


v(E) dE = f(E, T) p(E) dE (14) 
It is shown in statistical mechanics that, for fermions, the function f (E, T)can be written: 


1 
E, T) = —-——_- 15 
JET) eE-uWkT 4 4 (15) 


where y is the chemical potential, also called the Fermi level of the system. This is the 
Fermi-Dirac distribution. The Fermi level is determined by the condition that the total 
number of electrons must be equal to N: 


+o 
ABE y (16) 
eE- MKT +1 


p depends on the temperature, but it can be shown that it varies very slowly for small T. 
The shape of the function f(E, T) is shown in figure 3. At absolute zero, f(E, 0) is equal 
to l for E < pand to 0 for E > y (“step” function). At non-zero temperatures, f(E, T) 
has the form of a rounded “step” (the energy interval over which it varies is of the order 
of a few kT as long as kT < p). 

For a free electron gas, it is clear that the Fermi level at absolute zero coincides 
with the Fermi energy E, calculated in $1-a. According to (14) and the form that f(E, T) 
takes for T = 0 (fig. 3), u then characterizes, like Ep, the highest individual energy. 

On the other hand, for a system with a discrete spectrum of energy levels 
(E,, E), ..., E,,...), the Fermi level y obtained from formula (16) does not coincide with the 
highest individual energy Em in the ground state at absolute zero. In this case. the density 
of states is composed of a series of “delta functions” centered at E,, Ez, ..., Ej, ...; 
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A f(E, T) 


FIGURE 3 


Shape of the Fermi-Dirac distribution at 
absolute zero (dashed line) and at low 
temperatures (solid line). 

For an electron gas at absolute zero, 
the Fermi level y coincides with the Fermi 
energy Ep. The curves in figure 2 can be 
obtained by multiplying the density of 
states p(E) by f(E, T). 


consequently, at absolute zero, y can take on any value between E, and E,,, ,, Since, 
according to (14), all these possibilities lead to the same value of v(E). We choose to define y 
at absolute zero as the limit of (T) as T approaches zero. Since at non-zero temperatures 
the level E,, empties a little, and £,, , , begins to fill, the limit of (T) is found to be a value 
between E,, and E,,., (halfway between these two values if the two states E,, and E,,, 
have the same degree of degeneracy). 

Similarly, for a system containing a series of allowed energy bands separated by 
forbidden bands (electrons of a solid; cf. complement Fx), the Fermi level yw is in 
a forbidden band when the highest individual energy at absolute zero coincides with 
the upper limit of an allowed band. On the other hand, the Fermi level is equal to E, 
when E) falls in the middle of an allowed band. 


(ii) The preceding results explain the behavior of the specific heat of metals at very low 
temperatures. At ordinary temperatures, the specific heat is essentially due to vibrations 
of the ionic lattice (cf. complement L,), since that of the electron gas is practically 
negligible. However, the specific heat of the lattice approaches zero as T? for small 7. 
Therefore, that of the electron gas becomes preponderant at low temperatures (around 
1 °K), where, for metals, a decrease which is linear with respect to T is actually observed. 


B. Magnetic susceptibility 


Now suppose that a free electron gas is placed in a uniform magnetic field B 
parallel to Oz. The energy of an individual stationary state then depends on the 
corresponding spin state, since the Hamiltonian contains a paramagnetic spin 
term (cf. chap. IX, §A-2): 


W = -22 Bs, (17) 


where uz is the Bohr magneton: 


gh 


ome (18) 


Mz 


and S is the electron spin operator. For the sake of simplicity, we shall treat (17) 
as the only additional term in the Hamiltonian (the behavior of the spatial wave 
functions was studied in detail in complement E,,,). Under these conditions, the 
stationary states remain the same as in the absence of a magnetic field, and the 
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corresponding energy is increased or decreased by pB depending on the spin state. 
The densities of states p, (E}and p (E) corresponding respectively to the spin states 
| + > and | — > can therefore be obtained very simply from the density p(£) calcu- 
lated in St-a: 


| ? 
p+(E) = 5 P(E + gB) (19) 


Thus, at absolute zero, we arrive at the situation shown in figure 4. 


a p,(E) 


FIGURE 4 


The densities of states p, (E) and p (E) 
corresponding respectively to the spin 
states | + > and | — > (itg is negative). 
At absolute zero, only the states whose 
energies are less than Æ, are occupied. 


Since the magnetic energy |jt,/B is much smaller than Ep. the difference 
between the number of electrons whose spins are antiparallel to the magnetic field 
and the number whose spins are parallel to B is practically, at absolute zero: 


Las 
N_—N, => p(Ep)2 || B (20) 


The magnetic moment M per unit volume can therefore be written: 


ae 
M = al (N o— A +.) 
l s 
= på B HEN 21) 


This magnetic moment is proportional to the applied field. so that the magnetic 
susceptibility per unit volume is equal to: 


M yl : 
y =— = lg — ME) (22) 
Z B Lig E F 
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or, using expression (9) for p(E): 


3N uz 

A (23) 
21 Es 

COMMENTS: 


(i) We have assumed the system to be at absolute zero, but result (23) remains 
valid at low temperatures, since the modifications of the number of occupied 
states (fig. 2-b) are practically the same for both spin orientations. We therefore 
find a temperature-independent magnetic susceptibility. This is indeed what 
is observed for metals. 


(ii) Asin the preceding section. we see that the system behavior in the presence 
of a magnetic field is essentially determined by the electrons whose energies are 
close to E. This is another manifestation of Pauli’s principle. When the 
magnetic field is applied, the electrons in the | + > spin state tend to go into 
the | — > state, which is energetically more favorable. But most of them are 
prevented from doing so by the exclusion principle, since all the neighbouring 
| — > states are already occupied. 


c. PERIODIC BOUNDARY CONDITIONS 


o. Introduction 


The functions Pp n,n, given by formula (l-a) have a completely different 
structure from that of the plane waves e" which usually describe the stationary 
states of free electrons. This difference arises solely from the boundary conditions 
imposed by the walls of the box, since, inside the box, the plane waves satisfy the same 
equation as the Po mm. 


— — Ag(r) = E g(r) (24) 


The functions (l-a) are less convenient to handle than plane waves; this is why 
the latter are preferably used. To do so, we impose on the solutions of equation (24) 
new, artificial, boundary conditions which do not exclude plane waves. Of course, 
since these conditions are different from those actually created by the walls of the 
box, this changes the physical problem. However, we shall show in this section 
that we can find the most important physical properties of the initial system in this 
way. For this to be true, it is necessary for the new boundary conditions to lead 
to a discrete set of possible values of k such that: 


(i) The system of plane waves corresponding to these values of k constitutes 
a basis on which can be expanded any function whose domain is inside the box. 

(ii) The density of states p'(£) associated with this set of values of k is identical 
to the density of states p(£) calculated in $ l-a from the true stationary states. 
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Of course, the fact that the new boundary conditions are different from the 
real conditions means that the plane waves cannot correctly describe what happens 
near the walls (surface effects). However, it is clear that they can, because of condi- 
tion (ii), lead to a very simple explanation of the volume effects, which, according 
to what we have seen in $!-b. depend only on the density of states p(E). Moreover. 
because of condition (/). the motion of any wave packet far from the walls can be 
correctly described by superposing plane waves, since, between two collisions with 
the walls, the wave packet propagates freely. 


B. The Born-Von Karman conditions 


We shall no longer require the individual wave functions to go to zero at the 
walls of the box, bui. rather, to be periodic with a period L: 


g(x + L, v.z) = (x. y, z) (25) 


ik.r 


with analogous relations in y and z. Wave functions of the form e™*" satisfy these 


conditions if the components of the vector k satisfy: 


, 20 

k; = hT 

, 27 

k, = hT 

2n 
aT 26 
k= nT (26) 


where, now, 7, n, and n, are positive or negative integers or zero. We therefore 
introduce a new system of wave functions : 


, I jo ke tn tues) 

Prom E) am L 2 el E (27) 
which are normalized inside the volume of the box. The corresponding energy, 
according to (24), can be written: 


, =— (n +n? E (28) 


Any wave function defined inside the box can be extended into a periodic 
function in x, y, z, of period L. Since this periodic function can always be expanded 
in a Fourier series (cf. appendix I, $ 1-b), the { Pr; nzn, (r) } system constitutes a basis 
for wave functions with a domain inside the box. To each vector K,; n; n,» Whose 
components are given by (26), there corresponds a well-defined value of the energy 
En; ni n given by (28). Note, however, that the vectors k,,. ,,,,,,, can now have positive, 
negative or zero components, and that their tips divide space into elementary cubes 
whose edges are twice that found in $ l-a. 
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In order to show that boundary conduions (25) lead to the same physical 
results (as far as the volume effects are concerned) as those of $ l-a, it suffices to 
calculate the number n'(E) of stationary states of energy less than E, and find the 
value (5) [the Fermi energy E, and the density of states p(E) can be derived directly 
from 1(£)]. We evaluate n’(E) in the same way as in S!-a. taking into account the 
new characteristics of the vectors k,,. a;n; Since the components of k can now have 


arbitrary signs, the volume of the sphere of' radius Y 2m,E/h? must no longer be 
divided by 8. However, this modification ts compensated by the fact that the 
volume element (27/L)* associated with each of the states (27) is eight times larger 
than the one corresponding to the boundary conditions of $1-a. Consequently, n'(E) 
is the same as expression (5) for n(F). 

The periodic boundary conditions (25) therefore permit us to meet conditions (+) 
and (ii) of the preceding section. They are usually called the Born-Von Karman 
conditions (“ B.V.K. conditions”). 


COMMENT: 


Consider a truly free electron (not enclosed in a box). The eigenfunctions of the 
three components of the momentum P (and. consequently. those of the Hamiltonian 
H = P?:2m,) form a “continuous basis”: 


(4) f ¿por Al 29 
2nh E (29) 


We have already indicated several times that the states for which the form (29) is valid 
in all space are not physical states, but, can be used as mathematical intermediaries in 
studying the physical states, which are wave packets. 

We sometimes prefer to use the discrete basis (27) rather than the continuous 
basis (29). To do so, we consider the electron to be enclosed in a fictitious box of edge L. 
much larger than any dimension involved in the problem, and we impose the B.V.K. 
conditions. Any wave packet, which will always be inside the box for sufficiently large L, 
can be as well expanded on the discrete basis (27) as on the continuous basis (29). The 
states (27) can therefore, like the states (29). be considered to be intermediaries of the 
calculation; however, they present the advantage of being normalized inside the box. 
We must, of course, check, at the end of the calculations, that the various physical 
quantities obtained (transition probabilities. cross sections,...) do not depend on L, 
provided that £ is sufficiently large. 

Obviously. for a truly free electron, L has no physical meaning and can be 
arbitrary, as long as it is sufficiently large for the states (27) to form a basis on which the 
wave packets involved in the problem can be expanded [condition (/) of $ I-c-4]. On the 
other hand, in the physical problem which we are studying here, £* is the volume inside 
which the N electrons are actually confined and has, consequently, a definite value. 


2. Electrons in solids 
a. ALLOWED BANDS 


The model of free electron gas enclosed in a box can be applied rather well to 
the conduction electrons of a metal. These electrons can be considered to move freely 
inside the metal. the electrostatic attraction of the crystalline lattice preventing them 
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from escaping when they approach the surface of the metal. However. this model 
does not explain why some solids are good electrical conductors while others are 
insulators. This is a remarkable experimental fact : the electric properties of crystals 
are due to the electrons of the atoms of which they are composed: yet. the 
intrinsic conductivity can vary by a factor of 103% between a good insulator and a pure 
metal. We shall see. in a very qualitative way. how this can be explained by Pauli's 
principle and by the existence of energy bands arising from the periodic nature of the 
potential created by the ions (cf. complements O; and Fx). 

We showed in complement Fx, that if, in a first approximation. we consider 
the electrons of a solid to be independent. their possible individual energies are 
grouped into allowed bands, separated by forbidden bands. Assuming that each 
electron is subject to the influence of a linear chain of regularly spaced positive ions. 
we found. in the strong-bond approximation. a series of bands. each one contain- 
ing 2.1” levels. where 4° is the number of ions (the factor 2 arises from the spin). 

The situation, of course, is more complex in a real crystal, in which the positive 
ions occupy the nodes of a three-dimensional lattice. The theoretical understanding 
of the properties of a solid requires a detailed study of the energy bands, a study 
which is based on the spatial characteristics of the crystalline lattice. We shall not 
treat in detail these specific problems of solid state physics. We shall content our- 
selves with a qualitative discussion of the phenomena. 


b. POSITION OF THE FERMI LEVEL AND ELECTRIC CONDUCTIVITY 


Knowing the band structure and the number of states per band, we obtain the 
ground state of the electron system of a solid by successively * filling” the individual 
states of the various allowed bands, beginning, of course, with the lowest energies. 
The electron system is really in the ground state only at absolute zero. However, 
as we pointed out in $1-b-x, the characteristics of this ground state permit the semi- 
quantitative understanding of the behavior of the system at non-zero temperatures 

- often. up to ordinary temperatures. Like the thermal and magnetic properties 
(cf. S1-b). the electrical properties of the system are principally determined by the 
electrons whose individual energies are very close to the highest value Ep. If we 
place the solid in an electric field, an electron whose initial energy is much lower 
than E, cannot gain energy by being accelerated, since the states it would reach in 
this way are already occupied. It is therefore essential to know the position of E, 
relative to the allowed energy bands. 

First of all, we shall assume (fig. 5-a) that Ep falls in the middle of an allowed 
band. The Fermi level 4 is then equal to E, [ cf. comment (1) of $1-b-x |. The electrons 
whose energies are close to E, can easily be accelerated, in this case, since the 
slightly higher energy states are empty and accessible. Consequently, a solid for 
which the Fermi level falls in the middle of an allowed band is a conductor. The 
electrons with the highest energies then behave approximately like free particles. 

Consider, on the other hand, a solid for which the ground state is composed 
of entirely occupied allowed bands (fig. 5-b). Ep is then equal to the upper limit of 
an allowed band. and the Fermi level ¡e falls inside the adjacent forbidden band 
[ cf. comment (/) of $1-b-x]. In this case, no electrons can be accelerated, since the 
energy states immediately above theirs are forbidden. Therefore, a solid for which 
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the Fermi level falls inside a forbidden band is an insulator. The larger the interval AE 
between the last occupied band and the first empty allowed band. the better the 
insulator. We shall return to this point later. 


IAE oo 


FIGURE 5 


Schematic representation of the individual levels occupied by the electrons at absolute zero 
(in grey). E, is the highest individual energy. 

In a conductor (fig. a), 1: (which then coincides with the Fermi level x) falls inside an allowed band, 
called the “conduction band”. The electrons whose energies are near E, can then be accelerated 
easily, since the slightly higher energy states are accessible to them. 

In an insulator (fig. b), £, falls on the upper boundary of an allowed band called the “valence band ” 
(the Fermi level x is then situated in the adjacent forbidden band). The electrons can be excited 
only by crossing the forbidden band. This requires an energy at least equal to the width 4£ of 
this band. 


The deep allowed bands, completely occupied by electrons and, consequently, 
inert from an electrical and thermal point of view, are called valence bands. They 
are generally narrow. In a “strong-bond” model (¢/. complement Fy. $2). these 
bands arise from the atomic levels of lowest energies, which are only slightly affected 
by the presence oHthe other atoms in the crystal. On the other hand, the higher bands 
are wider; a partially occupied band is called a conduction band. 

For a solid to be a good insulator, the last occupied band must not only be 
entirely full in the ground state, but also, separated from the immediately higher 
allowed band by a sufficiently wide forbidden band. As we have indicated ($1-b-x), 
al non-zero temperatures, some states of energy lower than E, can empty. while 
some higher energy states fill (fig. 2-b). For the solid to remain an insulator at the 
temperature T, the width AE of the forbidden band which prevents this excitation 
of electrons must be much larger than A7. If AE is less than or of the order of kT. 
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a certain number of electrons leave the last valence band to occupy states of the 
immediately higher allowed band (which would be completely empty at absolute 
7ero). The crystal then possesses conduction electrons. but in restricted numbers: 
If is a semiconductor (such a semiconductor is called intrinsic: see comment below). 
For example. diamond. for which AF is close to 5 eV. remains an insulator at 
ordinary temperatures, while silicon and germanium. although quite similar to 
diamond, are semiconductors: their forbidden bands have a width AE less than | eV. 
These considerations, while very qualitative, enable us to understand why the 
electrical conductivity of a semiconductor increases very rapidly with the tempera- 
ture: with more quantitative arguments. we indeed find a dependence of the 
form e AT 

The properties of semiconductors also reveal an apparently paradoxical 
phenomenon. It is as if. in addition to the electrons which have crossed the 
forbidden band AE at a temperature /. there existed in the crystal an equal number 
of particles with a positive charge. These particles also contribute to the electric 
current, but their contribution to the Hall eflect*, for example. is opposite in sign 
to what would be expected for electrons. This can be explained very well by band 
theory, and constitutes a spectacular demonstration of Pauli's principle. To under- 
stand this qualitatively, we must recall that the last valence band. when it is comple- 
tely full in the vicinity of absolute zero, does not conduct any current (Pauli's 
principle forbids the corresponding electrons from being accelerated). When. by 
thermal excitation, certain electrons move into the conduction band, they free the 
states they had occupied tn the valence band. These empty states in an almost full 
band are called “holes”. Holes behave like particles of charge opposite to that of 
the electron. If an electric field is applied to the system, the electrons remaining 
in the valence band can move, without leaving this band, and occupy the empty 
states. In this way. they “fill holes” but also “leave new holes behind them”. 
Holes therefore move in the direction opposite to that of the electrons. that is. 
as if they had a positive charge. This very rough argument can be made more 
precise, and it can indeed be shown that holes are in every way equivalent to 
positive charge carriers. 


COMMENT! 


We have been speaking only of chemically pure and geometrically perfect crystals. 
However. in practice. all solids have imperfections and impurities. which often play an 
important role. particularly in semiconductors. Consider. for example. a quadrivalent 
silicon or germanium crystal, in which certain atoms are replaced by pentavalent impu- 
rity atoms, such as phosphorus. arsenic or antimony (this often happens. without any 
important change in the crystal structure). An atom of such an impurity possesses one 
too many outer electrons relative to the neighboring silicon or germanium atoms: 
it is called an electron donor. The binding energy .1£, of the additional electron is consi- 
derably lower in the crystal than in the free atom (it ts of the order of a few hundredths 
of an eV): this is due essentially to the large dielectric constant of the crystal. which 
reduces the Coulomb force (ef. complement Ayp $ f-a-6). The result is that the excess 
electrons brought in by the donor atoms Move More easily into the conduction band than 


* Recall what the Hall effect is: in a sample carrying a current and placed in a magnetic field 
perpendicular to this current. the moving charges are subject to the Lorentz force. In the steady state. 


this causes a transverse electric field to appear (perpendicular to the current and to the magnetic field). 
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do the “normal” electrons which occupy the valence band (fig. 6-a). The crystal thus 
becomes a conductor at a temperature much lower than would pure silicon or germanium. 
This conductivity due to impurities is called extrinsic. Analogously, a trivalent impurity 
(like boron, aluminium or gallium) behaves in silicon or germanium like an electron 
acceptor: it can easily capture a valence band electron (fig. 6-b), leaving a hole which can 
conduct the current. In a pure (intrinsic) semiconductor, the number of conduction 
electrons is always equal to the number of holes in the valence band. An extrinsic semi- 
conductor, on the other hand, can, depending on the relative proportion of donor and 
acceptor atoms, contain more conduction electrons than holes (it is then said to be of the 


Conduction 
band 
= 


AEn oaea rosas 
Boner level Forbidden A Ñ 
band Acceptar level RN 
tAE, 
Valence 
band 
a : typen b : type p 
FIGURE 6 


Extrinsic semiconductors : donor atoms (fig. a) bring in electrons which move easily into the 
conduction band, since their ground states are separated from it only by an energy interval AE, 
which is much smaller than the width of the forbidden band. Acceptor atoms (fig. b) easily capture 
valence band electrons, since, for this to happen, these electrons need only an excitation 
energy AE, which is much smaller than that needed to reach the conduction band. This process 
creates, in the valence band, holes which can conduct current. 


n-type, since the majority of charge carriers are negative). or more holes than conduction 
electrons (p-type semiconductors with a majority of positive charge carriers). These 
properties serve as the foundation of numerous technological applications (transistors, 
rectifiers, photoelectric cells, etc.). This is why impurities are often intentionally added 
to a semiconductor to modify its characteristics : this is called “doping”. 


References and suggestions for further reading: 
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For the solid state physics part, see Feynman III (1.2), chap. 14 and section 13 
of the bibliography. 
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1. Let Ay be the Hamiltonian of a particle. Assume that the operator A acts 


only on the orbital variables and has three equidistant levels of energies 0. hog. 
2hw, (where wọ is a real positive constant) which are non-degenerate in the orbital 
state space é, (in the total state space, the degeneracy of each of these levels is equal 
to 2s + 1, where s is the spin of the particle). From the point of view of the orbital 
variables, we are concerned only with the subspace of $, spanned by the three 
corresponding eigenstates of hy. 

a. Consider a system of three independent electrons whose Hamiltonian can 
be written: 


H = holl) + ho(2) + Ao (3) 


Find the energy levels of H and their degrees of degeneracy. 
b. Same question for a system of three identical bosons of spin 0. 


2. Consider a system of two identical bosons of spin s = | placed in the same 
central potential V(r). What are the spectral terms (cf. complement By,y. §2-b) 
corresponding to the Is”, 1s2p, 2p? configurations? 


3. Consider the state space of an electron, spanned by the two vectors | Po. >? 
and | p, > which represent two atomic orbitals, p, and p,, of wave functions ¢, (r) 
and p, (r) (cf. complement E, ¡¡. $2-b): 


Pp (£) = xf (r) = sin 8 cos orf (r) 
Pp, (T) = f(r) = sin O sin orf (r) 


a. Write, in terms of | p, > and |g, >. the state | y, > which represents 
the p, orbital pointing in the direction of the xOy plane which makes an angle x 
with Ox. 


b. Consider two electrons whose spins are both in the | + > state, the cigen- 
state of S, of eigenvalue + h/2. 

Write the normalized state vector | y > which represents the system of two 
electrons, one of which is in the state |“, > and the other. in the state Pr, >. 

c. Same question, with one of the electrons in the state | P, > and the other 
one in the state | Pp, >» Where « and f} are two arbitrary angles. Show that the state 
vector | y > obtained is the same. 

d. The system is in the state | y > of question b. Calculate the probability 
density P(r, 0, gr’, 0’, g’) of finding one electron at (r, O, p) and the other one at 
(r 0, p'). Show that the electronic density p(r. 0, p). [the probability density of 
finding any electron at (r, 0, p)] is symmetrical with respect to revolution about the 
Oz axis. Determine the probability density of having py — y” = Po. Where €, is 
given. Discuss the variation of this probability density with respect to po. 
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4. Collision between two identical particles 
The notation used is that of $D-2-a-B of chapter XIV. 


a. Consider two particles. (1) and (2), with the same mass m, assumed for the 
moment to have no spin and to be distinguishable. These two particles interact 
through a potential V(r) which depends only on the distance between them, r. 
At the initial time zo. the system is in the state | | : pe,: 2 : — pe, >. Let U(1, tọ) be 
the evolution operator of the system. The probability amplitude of finding it in the 
state | | : pn; 2 : — pn > at time 4, is: 


F(n)= (1 :m:2: — p | U(t, tọ) |1: pe,:2: — pe. > 


Let O and y be the polar angles of the unit vector n in a system of orthonormal 
axes Oxyz. Show that F(n) does not depend on q. Calculate in terms of F(n) the 
probability of finding any one of the particles (without specifying which one) with 
the momentum pn and the other one with the momentum — pn. What happens 
to this probability if O is changed to x — 0? 


b. Consider the same problem [with the same spin-independent interaction 
potential V(r)], but now with two identical particles. one of which is initially in the 
state | pe., m, >, and the other, in the state | — pe., m > (the quantum numbers m, 
and m< refer to the eigenvalues m, hand mh of the spin component along Oz). Assume 
that m, 4 m. Express in terms of F(n) the probability of finding, at time £,, one 
particle with momentum pn and spin m, and the other one with momentum — pn and 
spin m/. If the spins are not measured, what is the probability of finding one 
particle with momentum pn and the other one with momentum — pn? What happens 
to these probabilities when 0 is changed to z — 0? 


c. Treat problem / for the case m, = m;. In particular, examine the 0 = 7/2 
direction, distinguishing between two possibilities, depending on whether the 
particles are bosons or fermions. Show that, again, the scattering probability is the 
same in the 0 and z — 0 directions. 


5. Collision between two identical unpolarized particles 


Consider two identical particles, of spin s, which collide. Assume that their 
initial spin states are not known: each of the two particles has the same probability 
of being in the 2s + | possible orthogonal spin states. Show that, with the notation 
of the preceding exercise, the probability of observing scattering in the n direction is: 


E 
2s + | 


[F(n)|? + |F(— m? + [F*(n)F(— n) + c.c.] 


le = + | for bosons, — 1 for fermions). 
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6. Possible values of the relative angular momentum 
of two identical particles 


Consider a system of two identical particles interacting by means of a potential 
which depends only on their relative distance, so that the Hamiltonian of the system 
can be written: 


LPi 4 


q 2m | 2m 


+ V(R, — Ra]) 
As in $ B of chapter VII, we set: 


l 
R,=5(R, +R;) Po =P, +P, 


R =R, —R, P =>(P, — P.) 
H then becomes: 

H =H, + H, 
with: 


: 
a PG 
G 4m 


P2 
=a + V(R) 


r 


a. First, we assume that the two particles are identical bosons of zero spin 
(x mesons, for example). 
a. Weusethe { | r¿,r > } basis of the state space 4 of the system. composed 
of common eigenvectors of the observables Rg and R. Show that. if P,, is the 
permutation operator of the two particles: 


Pa | Igor» = |re -r> 


p. We now go to the [| p¿: Ep, 4. m > } basis of common eigenvectors 
of Pg, H,, L? and L, (L = R x P is the relative angular momentum of the two 
particles). Show that these new basis vectors are given by expressions of the form: 


1 , 
> Epdm > = —— | dir, eiPo ta? 
| Po z| , 


x [er Ra lr) Y7(0. p) | ror > 
Show that: 
Pa, | Po: E, h m> = (— 1)'| pg: E,.1.m> 
7. What values of / are allowed by the symmetrization postulate ? 
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b. The two particles under consideration are now identical fermions of 
spin 1/2 (electrons or protons). 

a. In the state space of the system, we first use the [ | ro, r; S, M > } basis of 
common eigenstates of R¿, R, S? and S,, where S = S, + S, is the total spin of 
the system (the kets | S, M > of the spin state space were determined in $B of 
chapter X). Show that: 


Pa let, M > = (— AP |ro = SS 


B. We now go to the { | pg: E,./.m:S, M > } basis of common eigenstates 
of P¿, H,, L?, L., S? and S.. 
As in question a- f$, show that: 


Pa, | Pa; En ms S.M > = (— DY "(= 1)'| pg: E, dm; S, M> 


y. Derive the values of / allowed by the symmetrization postulate for each 
of the values of S (triplet and singlet). 


c. (more difficult) 
Recall that the total scattering cross section in the center of mass system of two 
distinguishable particles interacting through the potential V(r) can be written: 


gza Y (21 + 1)sin? 6, 
k? ¡20 


where the 6, are the phase shifts associated with V(r) [cf. chap. VHI, formula (C-58)]. 
a. What happens if the measurement device is equally sensitive to both particles 
(the two particles have the same mass)? 
P. Show that, in the case envisaged in question a, the expression for c 
becomes: 


T las 


O ==> 
2 
k l even 


y. For two unpolarized identical fermions of spin 1/2 (the case of question b), 
prove that: 


2 de a Se 
ee Y (2) + 1)sin?6, +3 Y (21 + 1) sin? ò, 
k? leven l odd 
7. Position probability densities 


for a system of two identical particles 


Let |p > and |y > be two normalized orthogonal states belonging to the 
orbital state space $, of an electron, and let | + > and | — > be the two eigenvectors, 
in the spin state space 4,. of the S, component of its spin. 
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a. Consider a system of two electrons, one in the state | p, + > and the other, 
in the state | y, — >. Let pyy(r, r')d*rd*r" be the probability of finding one of them 
in a volume d?r centered at point r, and the other in a volume d?r’ centered at r' 
(two-particle density function). Similarly, let p,(r)d?r be the probability of finding 
one of the electrons in a volume d?r centered at point r (one-particule density 
function). Show that: 


| p(r)|? [x(r’) 
lør)? + |x) 


Il 


2 + lole)? |z)? 


2 


pult. r’) 
plr) 


Show that these expressions remain valid even if | p > and |z > are not 
orthogonal in 6.. 


Calculate the integrals over all space of p,(r) and plr. r'). Are they equal to |? 

Compare these results with those which would be obtained for a system of 
two distinguishable particles (both spin 1/2). one in the state | p. + > and the other 
in the state | y. — >: the device which measures their positions is assumed to be 
unable to distinguish between the two particles. 


h. Now assume that one electron is in the state | p. + > and the other one. 
in the state | 7. + >. Show that we then have: 


lo(r)z(r’) — ple) (r)]? 
lotr)? + |z(r)]* 


Pylr, r’) 
p(t) 


Calculate the integrals over all space of p,(r) and p,,(r. r’). 

What happens to p, and pu if | p > and | y > are no longer orthogonal in 6,? 

c. Same questions for two identical bosons, either in the same spin state or in 
two orthogonal spin states. 


8. The aim of this exercise is to demonstrate the following point : once the state 
vector of a system of N identical bosons (or fermions) has been suitably symmetrized 
(or antisymmetrized), it is not indispensable, in order to calculate the probability 
of any measurement result, to perform another symmetrization (or antisymmetri- 
zation) of the kets associated with the measurement. More precisely, provided 
that the state vector belongs to 46, (or £',), the physical predictions can be calculated 
as if we were confronted with a system of distinguishable particles studied by 
imperfect measurement devices unable to distinguish between them. 

Let | y > be the state vector of a system of N identical bosons (all of the 
following reasoning is equally valid for fermions). We have: 


S|\W>=|w> (1) 


L. 


a. Let |z > be the normalized physical ket associated with a measurement 
in which the N bosons are found to be in the different and orthonormal individual 
states | u, >, | tig >. ..., | u, >- Show that: 


lx) =VNIES| Lugs 2:up i Niu) (2) 


1451 


www.elsolucionario.net 


COMPLEMENT Dis 


b. Show that, because of the symmetry properties of | y >: 
Clair Now, >? = Kira prada ly 


where /, j, .... Lis an arbitrary permutation of the numbers 1, 2, ..., N. 


c. Show that the probability of finding the system in the state ED can be 
written: 


¡SAD 


NT: 2 Na yol 
y ite age lay (3) 


ido do 


where the summation is performed over all permutations of the numbers 1, 2, ..., N. 


d. Now assume that the particles are distinguishable, and that their state is 
described by the ket | y >. What would be the probability of finding any one of 
them in the state | u, >. another one in the state | 1, >, .... and the last one in the 
state | 11, >? 

Conclude, by comparison with the results of c, that, for identical particles, 
it is sufficient to apply the symmetrization postulate to the state vector | y > of the 
system. 

e. How would the preceding argument be modified if several of the individual 
states constituting the state | z > were identical? (For the sake of simplicity, 
consider only the case where N = 3). 


H. (more difficult) 


Now, consider the general case, in which the measurement result being 
considered is not necessarily defined by the specification of individual states, since 
the measurement may no longer be complete. According to the postulates of 
chapter XIV, we must proceed in the following way in order to calculate the 
corresponding probability : 

— first of all, we treat the particles as distinguishable, and we number them: 
their state space is then 6. Then let 6,, be the subspace of $ associated with the 
measurement result envisaged, with the measurement being performed with devices 
incapable of distinguishing between the particles : 

— with |y,, > denoting an arbitrary ket of 8,,, we construct the set of 
kets S | Ym > which constitutes a vector space £$ (&3 is the projection of &,, onto &s); 
if the dimension of 435 is greater than I, the measurement is not complete; 


— the desired probability is then equal to the square of the norm of the 
orthogonal projection of the ket | y > onto 63 which describes the state of the N 
identical particles. 


a. If P, is an arbitrary permutation operator of the N particles, show that, 
by construction of ó „: 


P, | Win > € Om 


Show that 4,, is globally invariant under the action of S and that £$ is simply the 
intersection of 6, and 6,,. 
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b. We construct an orthonormal basis in $,,: 
Lm >> | Pm Ps << [Pm >> [Om Ys | Om > 


the first k vectors of which constitute a basis of 63. Show that the kets S | p} > 
where k + 1 < n < p, must be linear combinations of the Bf! k vectors of this 
basis. Show, by taking their scalar products with the bras < o} |. < pg h = < ot |. 
that these kets S | p" > (with n > k + 1) are necessarily zero. 


c. Show from the preceding results that the symmetric nature of | y > implies 
that: 


p k 
3 Ko |W >)? = 2 Con yor 
that is: 


Cw Ph lw > = <y | Pal y> 


where P$ and P,, denote respectively the projectors onto 8% and 6,,. 

Conclusion: The probabilities of the measurement results can be calculated 
from the projection of the ket | y > (belonging to 6,) onto an eigensubspace ém 
whose kets do not all belong to 4,. but in which all the particles play equivalent 
roles. 


9. One- and two-particle density functions 
in a electron gas at absolute zero 


I. 


a. Consider a system of N particles 1, 2, .... i .... N with the same spin s. 
First of all, assume that they are not identical. In the state space é (i) of particle (7). 
the ket | į : rọ, m > represents a state in which particle (1) is localized at the point r, 
in the spin state | m > (mh: the eigenvalue of S_). > 

Consider the operator: 


= Y [isrom> <i: rome ia} 


where /(/) is the identity operator in the space &(/). 

Let | y > be the state of the N-particle system. Show that < y | F,,(ro) | > dt 
represents the probability of finding any one of the particles in the infinitesimal 
volume element dz centered at ry. the component of its spin being equal to mm. 

b. Consider the operator: 


N 
Gmmlto ty) = Y Y e litrgmig tegen’ lirna] || w} 


i=l jži kij 


What is the physical meaning of the quantity < Y | Gmm (Fo. ta) | y > dt dr”. 
where dt and dt’ are infinitesimal volumes ? 
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The mean values < y | F,, (Io) | Y > and < y | Gmm (Fo. ro) | y > will be written, 
respectively, pl (r,) and pl.(ro, ro) and will be called the one- and two-particle 
density functions of the N-particle system. 

The preceding expressions remain valid when the particles are identical. 
provided that | y > is the suitably symmetrized or antisymmetrized state vector of 
the system (cf. preceding exercise). 


II. 
Consider a system of N particles in the normalized and orthogonal individual 


states | u, >, | uz >... | uy >. 
The normalized state vector of the system is: 


Iw =JSNIT|I ie? EN 2 ily 


where 7 is the symmetrizer for bosons and the antisymmetrizer for fermions. 
In this part, we want to calculate the mean values in the state | y > of symmetric 
one-particle operators of the type: 


i=l j+i 


N 
F=) < fi @ I] 10 } 
or of symmetric two-particle operators of the type: 


c= È y {anes 10 | 


i=l j#i kei 
a. Show that: 
CW) Fl yy = Clty 2iugsiN sw || Da. | 
PUN Oly 
where e, = + I for bosons,and + l or — 1 for fermions, depending on whether 


the permutation P, is even or odd. 
Show that the same expression is valid for the operator G. 


b. Derive the relations: 


Gu FI) = Y <i Oliu 


R 
<yY|G|y> z 2 2 (Citas jru;| gif iu jo) 
i=1 jæi 
+a[iiupzjiu | ws) SOS 
with £ = + 1 for bosons, £ = — 1 for fermions. 
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HI. 


We now want to apply the results of part II to the operators F„(rọ) and 

G mm (fo. ró) introduced in part I. The physical system under study is a gas "of N free 
electrons enclosed in a cubic box of edge L at absolute zero (complement Cy ¡y. $ 1). 
By applying periodic boundary conditions, we obtain individual states of the 
form | Pk? | + >. where the wave function associated with |x > is a plane 


ik.r 


„and the components of k satisfy relations (26) of complement Cy ¡y. 


We shall call E, = hi*k7¿2m the Fermi energy of the system and 4, = 22/k,. the 
Fermi wavelength. 

a. Show that the two one-particle density functions p! (rọ) and p" (rọ) are 
both equal to: 


p' (ro) = p' (ro) = 3 | Plro)|? 


where the summation over k is performed over all values of k of modulus less 
then A p, satisfying the periodic boundary conditions. By usings | of complement C,,,. 
show that p'(r,) =p! (ry) = kp/6n? = N/2L*. Could this result have been 
predicted simply ? 

b. Show that the two two-particle density functions p'! (ry. ró) and 
p"! , (ro. ró) are both equal to : 


Y Y | Malte) pr dro)? = N? ALS 
k k’ 


where the summations over k and k’ are defined as above. Give a physical inter- 
pretation. 

c. Finally, consider the two two-particle density functions p"! , (rọ. r4) and 
p" (ro. ro). Prove that they are both equal to: 


y y { [rl o) MARS)? = PET) plo) Alro) Py Ato) } 
k kk 


Show that the restriction k’ 4 k can be omitted, and show that the two two-particle 
density functions are equal to: 


-Xn — C?(kpd)] 


4L pre lx X Pla) Plo) 
with d = |ro — ra|. where the function C(x) is defined by: 
i OY shes 
C(x) = — [sin x — x cos x] 
x" 


(x can be replaced by an integral over k). 
k 


How do the two-particle density functions p!! (rọ. rg) and p'! (rọ. rg) vary 
with respect to the distance d between rọ and rọ? Show that it is practically 
impossible to find two electrons with the same spin separated by a distance much 
smaller than 4,. 
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Appendix | 
FOURIER SERIES AND FOURIER TRANSFORMS 


1. Fourier series 
a. Periodic functions 
b. Expansion of a periodic function in a Fourier series 
c. The Bessel-Parseval relation 
2. Fourier transforms 
a. Definitions 
b. Simple properties 
e. The Parseval-Plancherel formula 
d. Examples 
e 


. Fourier transforms in three-dimensional space 


In this appendix. we shall review a certain number of definitions, formulas and 
properties which are useful in quantum mechanics. We do not intend to enter into 
the details of the derivations, nor shall we give rigorous proofs of the mathematical 
theorems. 


1. Fourier series 


a. PERIODIC FUNCTIONS 


A function f(x) of a variable is said to be periodic if there exists a real 
non-zero number L such that, for all v: 
[fe + L) = s(x) | (1) 


L is called the period of the function f (x). 

If f(x) is periodic with a period of L, all numbers nL, where n is a positive or 
negative integer. are also periods of f(x). The fundamental period Ly of such a 
function is defined as being its smallest positive period (the term “period” is often 
used in physics to denote what is actually the fundamental period of a function). 


COMMENT: 


We can take a function f(x) defined only on a finite interval [«. b] 
of the real axis and construct a function f,(x) which is equal to f(x) 
inside [«, h] and is periodic, with a period (b — a). f,(x) is continuous if f(x) 
is and if : 


f(b) = f(a) (2) 
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We know tha: the trigonometric functions are periodic. In particular: 


x a x 

cos 27 and sin ant (3) 
have fundamental periods equal to L. 

Other particularly important examples of periodic functions are the periodic 
exponentials. For an exponential e** to have a period of L, it is necessary and 
sufficient, according to definition (1), that: 


ett = 1 (4) 
that is: 
aL = 2inn (5) 


where n is an integer. There are therefore two exponentials of fundamental 
period L: 


et nr (6) 
which are, furthermore, related to the trigonometric functions (3) which have the 


same period: 


+ 2in= x : x 
L = Lee 2 
cos 27 z + isin 2n T (7) 


The exponential e””*L also has a period of L, but its fundamental period is L/n. 


b. EXPANSION OF A PERIODIC FUNCTION IN A FOURIER SERIES 


Let f(x) be a periodic function with a fundamental period of L. If it satisfies 
certain mathematical conditions (as is practically always the case in physics), it 
can be expanded in a series of imaginary exponentials or trigonometric functions. 


a. Series of imaginary exponentials 


We can write /(x) in the form: 


| tx 


| foy= E pe (8) 
with: 
2 
k, = n= (9) 


The coefficients c, of the Fourier series (8) are given by the formula: 


1 xo+L 
jo [ dx ne f(x) (10) 


0 


where x, is an arbitrary real number. 
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To prove (10), we multiply (8) by e” “** and integrate between xo and x9 + L: 


xo +L +00 xo+L . 
f dxe“ "f(x = Y c, dx efn kx (11) 


xo n=- 00 xo 


The integral of the right-hand side is zero for n # p and equal to L forn = p. Hence formula (10). 
It can easily be shown that the value obtained for c, is independent of the number x, chosen. 


The set of values |c,| is called the Fourier spectrum of f(x). Note that f(x) 
is real if and only if: 


Co, = (12) 


B. Cosine and sine series 


If, in the series (8), we group the terms corresponding to opposite values 
of n, we obtain: 


f(x) = co + Y (e, eto +e., 07%) (13) 


that is, according to (7): 


f(x) =a) + > (a, cos k,x + b, sin k,x) (14) 


n=1 


(15) 


a Q 
= = 
oll 
Qu 
= 
+ 
o 
n | 
| 3 
= 
N 
= 
V 
© 


The formulas giving the coefficients a, and b, can therefore be derived from (10): 


1 xo+L 
Ay -zf dx f(x) 
2 oan 
a, = al dx f(x) cos k„x 
2 on 
ba -4f dx f(x) sin k,x (16) 


If f(x) has a definite parity, expansion (14) is particularly convenient, 
since: 


b,=0 if f(x) is even 
a, = 9 if f(x) is odd (17) 


Moreover, if f(x) is real, the coefficients a, and 6, are real. 
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c. THE BESSEL-PARSEVAL RELATION 


It can easily be shown from the Fourier series (8) that: 


— = 


i xo +L + x 
ot | alo E e (18) 


This can be shown using equation (8): 


1 xo+L xo+L 
al dx | f(x)? = Y che, al dx efn- kps (19) 
np 


xo xo 


As in (11), the integral of the right-hand side is equal to Lô„p. This proves (18). 


When expansion (14) is used, the Bessel-Parseval relation (18) can also be 
written: 


xo+L 
1 ' [E 
a | dx |)? = lag)? +3 E Clay + b, (20) 


oO 


If we have two functions, f(x) and g(x), with the same period L, whose 
Fourier coefficients are, respectively. c, and d,, we can generalize relation (18) 
to the form: 


1 Xo +L +a 

a | agost) = Y dec, (21) 
2. Fourier transforms 
a. DEFINITIONS 
A. The Fourier integral as the limit of a Fourier series 


Now, consider a function f(x) which is not necessarily periodic. We 
define f(x) to be the periodic function of period L which is equal to f(x) inside 
the interval [ — L/2, L/2]. f(x) can be expanded in a Fourier series: 


+a 


Ls 0 e (22) 


n=-x 


where k, is defined by formula (9), and: 


Xoth 
l —ikyx y l £ TÚ nx f 
7 | dx e7" f(x) = al dx ex f(x) (23) 


0 


When L approaches infinity, f,(x) becomes the same as f(x). We shall therefore 
let L approach infinity in the expressions above. 


1461 


www.elsolucionario.net 


APPENDIX | 


Definition (9) of k, then yields: 


2n 
n+l kn = L (24) 
We shall now replace 1/L by its expression in terms of (k,,, — k,) in (23) and 
substitute this value of ¢, into the series (22): 
L 
: € kn = k, iknx E -i i 
fis) =X O IS (25) 


2 
When L —=> œ, k,,, — k, approaches zero [cf. (24)], so that the sum over n is 
transformed into a definite integral. f, (x) approaches f(x). The integral appearing 


in (25) becomes a function of the continuous variable k. If we set: 


+x 


fik) = sz | dx e~ * f(x) (26) 


EX 


relation (25) can be written in the limit of infinite L: 


id= = | dk is f(x) 27) 


f(x) and f(k) are called Fourier transforms of each other. 


B. Fourier transforms in quantum mechanics 


In quantum mechanics, we actually use a slightly different convention. 
If w(x) is a (one-dimensional) wave function, its Fourier transform y(p) is defined 
by: 


Wp) = l | dx enirn y(x) (28) 


TE 
V 2rh 


and the inverse formula is: 


— — 
| Y 21h 


| dp e'?*" Wp) | (29) 


To go from (26) and (27) to (28) and (29), we set: 
p =hk (30) 


(p has the dimensions of a momentum if x is a length), and: 


-5(3) (31) 
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In this appendix, as is usual in quantum mechanics. we shall use definition (28) 
of the Fourier transform instead of the traditional definition. (26). To return to the 
latter definition. furthermore. all we need to do is replace fi by 1 and p by & in all 
the following expressions. 


b. SIMPLE PROPERTIES 


We shall state (28) and (29) in the condensed notation: 


vv) = FL] (32-a) 
wiv) = F[w(p)] (32-b) 
The following properties can easily be demonstrated: 
() Wp) = FTW) => Wp — po) = Fel" yey] (33) 


erat) = F [h(x — xo) 


This follows directly from definition (28). 


a _ e lo 
(i) Wp) =F WI] => F De] q Y (2) (34) 
To see this, all we need to do is change the integration variable : 

Ut = CX (35) 
In particular: 

Fiyi- x)] = Wp) (36) 
Therefore, if the function y(x) has a definite parity, its Fourier transform has the 
same parity. 

(iii) W(x) real <> (wip) ]* =ý- p} (37-a) 

w(x)pureimaginary <= [yw(p)|* = — w(— p) (37-b) 
The same expressions are valid if the functions y and y are inverted. 

(iv) If f denotes the ath derivative of the function f. successive differentiations 
inside the summation yield, according to (28) and (29): 


FLW") = (X) Hp) (38-a) 


gun (2) vo 


(v) The convolution of two functions y(x) and w(x) ts. by definition. the 
function W(x) equal to: 


(38-b) 


Wx) = | dy WiC) aly — 1) (39) 
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Its Fourier transform is proportional to the ordinary product of the transforms 
of w,(x) and Y, (x): 


Ho) = \/2ah W,(p) Yalp) (40) 


This can be shown as follows. 
We take the Fourier transform of expression (39) : 


E 1 + 00 + 
Wip) = | dx aos dy yi ly) yax — y) (41) 
2nh Y-a = 


20 


and perform the change of integration variables: 


(y = fw x 9) (42) 


If we multiply and divide by e'”"!* we obtain: 


= 1 +0 + xu 
ee 5l dy e Pty (y) | du ete y (u) (43) 
T -20 


which proves (40). 


= 00 


(vi) When W(x) is a peaked function of width Ax, the width Ap of (p) satisfies: 

Ax. Ap>h (44) 
(see $ C-2 of chapter I, where this inequality is analyzed. and complement Cy). 
c. THE PARSEVAL-PLANCHEREL FORMULA 


A function and its Fourier transform have the same norm: 


| | as W(x)? = I E Lp)? (45) 


vu — xXx 


To prove this, all we need to do is use (28) and (29) in the following way: 


| dx |y(x)|? = | dx y*(x) =] dp e'P*!* (p) 


se = 2nh 


- | dp Wp) all dx e'P=ih y*(x) 
- 0 TU =y 


= f dp Y*(p) V (p) (46) 


00 


As in $ 1-c, the Parseval-Plancherel formula can be generalized: 


| +x e _ 
3 | dx 0 *(x) y(x) = | dp 0*(p) yp) | (47) 


= OE 
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d. EXAMPLES 


We shall confine ourselves to three examples of Fourier transforms, for which 
the calculations are straightforward. 


(¿) Square function 


— l a 
y(x) == for -23<x<>3 
a 2 2 a 1 sin poi) (48) 
=0 for |x| >5 V2nh Pal 
(ii) Decreasing exponential 
= 2 la 

Jagr E A 49 

W(x) =e <> vp) amas (49) 


(iii) Gaussian function 


V(x) =e 8? = Hp) = — 


J 2h 


e” p?a? 4h? (50) 


(note the remarkable fact that the Gaussian form is conserved by the Fourier 
transform). 


COMMENT: 
In each of these three cases, the widths Ax and Ap can be defined for y(x) 


and w(p) respectively. and they verify inequality (44). 


e. FOURIER TRANSFORMS IN THREE-DIMENSIONAL SPACE 


For wave functions y(r) which depend on the three spatial variables x, y, z, 
(28) and (29) are replaced by: 


v(p) = 


fur e PFA lr) (51-a) 


(27h)? 


= d? ipri p $ 
r sae | perr" (p) (51-b) 


The properties stated above (382-b and 2-c) can easily be generalized to three 
dimensions. 

If y depends only on the modulus r of the radius-vector r, y depends only on 
the modulus p of the momentum p and can be calculated from the expression : 


1) ee | ‘Pai sin 2” y(r) (52) 
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First, we shall find using (51-a) the value of y for a vector p’ obtained from p by an 
arbitrary rotation 4: 


p = ?p (53) 


ar = dor —ip' rh r 
v(P’) a e y(r) (54) 


In this integral, we replace the variable r by r' and set: 


r= 2r (55) 
Since the volume element is conserved under rotation : 

Pr = dr (56) 
In addition, the function y is unchanged, since the modulus of r' remains equal to r; finally: 

p.r=p.r (57) 
since the scalar product is rotation-invariant. We thus find: 

lp) = Yp) (58) 


that is, y depends only on the modulus of p and not on its direction. 
We can then choose p along Oz to evaluate pp): 


2 1 
Wp) = Onn? 


l l rae ial ipr cos 
= a | r dr y(r) | de | dé sinte > 
(22h)? Jy 


1 Ñ 2h 
= —_| r? dr Wr) 22a — sin s 
0 


dr ec ir h y(r) 


(2nh)>'? pr 
1 27 - 
ue zf rdr y(r) sin Z (59) 
Vah P Jo h 


This proves (52). 


References and suggestions for further reading: 
Sec, for example. Artken (10.4). chaps. l4 and 15, or Butkov (10.8). chaps. 4 and 7: 


Bass (10.1). vol. I, chaps. XVIH through XX: section 10 of the bibliography. especially 
the subsection “ Fourier transforms: distributions”. 
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Appendix Il 
THE DIRAC ô-” FUNCTION” 


1. Introduction: principal properties 
a. Introduction of the 6-" function” 
b. Functions which approach 6 
c. Properties of ò 

2. The 0-“ function” and the Fourier transform 
a. The Fourier transform of 6 
b. Applications 

3. Integral and derivatives of the d-* function” 
a. ô as the derivative of the “unit step-function” 
b. Derivatives of ò 

4. The d-” function” in three-dimensional space 


The ô-“ function” is actually a distribution. However. like most physicists. we shall 
treat it like an ordinary function. This approach, although not mathematically 
rigorous, is sufficient for quantum mechanical applications. 


1. Introduction; principal properties 


a. INTRODUCTION OF THE 6-“FUNCTION” 


Consider the function 9“ (x) given by (cf. fig. 1): 


Ax) 
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The function ¿'*(x): a square function of 
width £ and height 1/2, centered at x = 0. 
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where s is a positive number, We shall evaluate the integral: 


| dx 5 (x) f(x) (2) 


where / (x) is an arbitrary function, well-defined for x = 0. IF ¢ 1s sufficiently small. 
the variation of f(x) over the effective integration interval [ — ¢/2, ¢/2] is negligible, 
and f(x) remains practically equal to £(0). Therefore: 


| dx d(x) f(x) = rof dx d(x) = f(0) (3) 


The smaller £, the better the approximation. We therefore examine the limit e = 0 
and define the 6-“function” by the relation: 


| dx a(x) f(x) = 100) (4) 


x 


which is valid for any function f(x) defined at the origin. More generally, d(x — xo) 
is defined by: 


o | AE (5) 


COMMENTS: 


(i) Actually, the integral notation in (5) is not mathematically justified. 6 is 
defined rigorously not as a function but as a distribution. Physically, this 
distinction is not an essential one as it becomes impossible to distinguish 
between ¿'(x) and 6(x) as soon as e becomes negligible compared to all the 
distances involved in a given physical problem*: any function /(x) which 
we might have to consider does not vary significantly over an interval of 
length e. Whenever a mathematical difficulty might arise, all we need to do is 
assume that 0(x) is actually 9 (x) [or an analogous but more regular function, 
for example, one of those given in (7), (8), (9), (10), (11) ], with £ extremely small 
but not strictly zero. 

(ii) For arbitrary integration limits a and b, we have: 


| dx 6(x) f(x) = f(0) if Oe[a,b] 
i =0 if O¢[a,b] (6) 


* The accuracy of present-day physical measurements does not, in any case, allow us to 
investigate phenomena on a scale of less than a fraction of a Fermi (1 Fermi = 107 '% m). 
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b. FUNCTIONS WHICH APPROACH ð 

It can easily be shown that, in addition to 9'(x) defined by (1), the following 
functions approach ò(x). that is, satisfy (5). when the parameter £ approaches zero 
from the positive side: 


(i) ae (7) 
a t- (8) 
(iii) Pl (9) 
iy = SB) (10) 
wy = Saws) 


We shall also mention an identity which is often useful in quantum mechanics 
(particularly in collision theory): 


Lim - 
e>0, NEE 


= Po + ind(x) (12) 


where ¥ denotes the Cauchy principal part, defined by* [ / (x) is a function which 
is regular at x = 0]: 


al E fa = Lim i a Esos A.B>0 (13) 


To prove (12), we separate the real and imaginary parts of l/(x + i): 


| x + fe 
ERE (14) 
xie x*+el 
Since the imaginary part is proportional to the function (8). we have : 
y E n 
Lim + i—— = F in òx) (15) 


=D. ES +e 
As for the real part, we shall multiply it by a function f(x) which is regular at the origin and 
integrate over xv: 


Lim | 2 0) = Lim Lim | IM an an iS acs > f(x) (16) 
r=0. Jo x + q? £70, yO 


* p often uses one of the following relations : 


an E TQ) -Í a +] at 


tB Tey A 
= | Es 0! - 40) + /(0) Log” 


A 


where f_(x) = [/f(x)— f(— x)]2 is the odd part of /(v). These formulas allow us to explicitls 
eliminate the divergence at the origin. 
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The second integral is zero: 


+n 
xd: 1 : 
Lim | “O f(x) = f(0) Lim = [Log (x? + a?)]" =0 (17) 
mo, Jo oF e n70. 2 f 


If we now reverse the order of the evaluation of the limits in (16), the ¢ —> 0 limit presents no 
difficulties in the two other integrals. Thus : 


Lim | = ie f + | [Ew (18) 
nO Jo x te? mO-LJ_y +n = 


This establishes identity (12). 


c. PROPERTIES OF 6 


The properties we shall now state can be demonstrated using (5). Multiplying 
both sides of the equations below by a function /(x) and integrating, we see that 
the results obtained are indeed equal. 


(i) = d(— x) = A(x) (19) 
i: Ae Se La (20) 


le 


and, more generally : 


d[y(x)| = Y ~~ (x — xX, 21 
where g(x) is the derivative of g(x)and the x are the simple zeros of the function g(x): 

g(x) = 0 

g'(x) #9 (22) 


The summation is performed over all the simple zeros of g(x). If g(x) has multiple 
zeros [that is, for which g’(x,) is zero], the expression d[g(x)] makes no sense. 


(iii) x d(x — xo) = xo (xX — Xo) (23) 
and, in particular: 

x d(x) = 0 (24) 
The converse is also true and it can be shown that the equation : 

x u(x) = 0 (25) 
has the general solution : 

u(x) = c d(x) (26) 


where c is an arbitrary constant. 
More generally : 


g(x) d(x — xo) = g(x) H&K — xo) (27) 
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(iv) | dx d(x — y) d(x — z) = Ó(y — z) (28) 


Equation (28) can be understood by examining functions 6'*(x) like the one shown in 
figure 1. The integral: 


Fy, z) = f dx Mx — y) ô® (x — 2) (29) 


— 0 


is zero as long as |y — z| > e, that is, as long as the two square functions do not overlap (fig. 2). 


ôx — y) ôx — z) 
Es «E, PA 


— FIGURE 2 


— 


m 


The functions ġ®’(x — y) and ô®’'(x — -): 

two square functions of width £ and height 

l/e, centered respectively at x = v and 
X x=, 


The maximum value of the integral, obtained for y = z, is equal to 1/2. Between this maximum 
value and 0, the variation of Fy, z) with respect to y — z is linear (fig. 3). We see immediately 
that FO (y, z) approaches 6(y — z) when £e —= 0. 


Fy, 2) 


— 


FIGURE 3 


m 


The variation with respect to v — 2 of the 
scalar product FX y, z) of the two square 
functions shown in figure 2. This scalar 
product is zero when the two functions do 
not overlap (|y — =| > €), and maximal 


i e when they coincide. F“)(), z) approaches 
ZE +e y=z Óó(y — 2) when ¿ —~ 0. 


COMMENT: 


A sum of regularly spaced 6-functions: 


> d(x — qL) (30) 


q=-0 


1472 


www.elsolucionario.net 
THE DIRAC DELTA-FUNCTION 


can be considered to be a periodic “function” of period L. By applying 
formulas (8), (9) and (10) of appendix I, we can write it in the form: 


+ xu 1 +a nx 


L ôx- aL) =z Y eT (31) 


q= -0 n ee 


2. The ô-” function” and the Fourier transform 


a. THE FOURIER TRANSFORM OF ô 


Definition (28) of appendix I and equation (5) enable us to calculate directly 
the Fourier transform 6, (p) of d(x — xp): 


Xo 


1 = 
ral. 


In particular, that of ó(x) is a constant: 


9D) 


dx e P= S(x — xo) = — eos (32) 
y 2nh 


õolP) == (33) 
v 21h 
The inverse Fourier transform [formula (29) of appendix I] then yields: 
1 ie i ii A | 
ò a is ip(x—xo)/h — 2 ik(x—xo) | 
| O(x — Xo) aah | dpe 7z B dk e | (34) 


Y — 00 


This result can also be found by using the function 6'(x) defined by (1) or any of the 
functions given in $ 1-b. For example, (48) of appendix I enables us to write: 
+ 4 


öx) = E dp eipsih sin (pe/2h) 


35 
2nh J pe/2h (23) 


If we let ¢ approach zero, we indeed obtain (34). 


b. APPLICATIONS 


Expression (34) for the $-function is often very convenient. We shall show, 
for example, how it simplifies finding the inverse Fourier transform and the 
Parseval-Plancherel relation [formulas (29) and (45) of appendix 1]. 

Starting with : 


— l 
p) == 
Vv 2nh 


| “dx eiri W(x) (36) 
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we calculate: 


dpe" amo] aperea (7 


al 


In the second integral, we recognize ó(x — £), so that: 


R | dp e!» ip) = | dé WE) dx — E) = veo) (38) 


v 2nh — 20 =a 


which is the inversion formula of the Fourier transform. 
Similarly : 


Wo) = =| dx cir? (x) | dx’ eH" yx) (39) 


— 00 — 00 


If we integrate this expression over p, we find: 


[ air =3 erof 


xX 


+x 


dx’ W(x’) | dp et? xh (40) 


uae ss 


+x 
— 20 


that is, according to (34): 


| dp (Wo)? = | dx bo i dx'y(x’) d(x — x’) = | dx lx)? 
-0 = (41) 


=- 0 — x 


which is none other than the Parseval-Plancherel formula. 
We can obtain the Fourier transform of a convolution product in an analogous 
way [cf. formulas (39) and (40) of appendix I]. 


3. Integral and derivatives of the 6-“function” 


a. ô IS THE DERIVATIVE OF THE “UNIT STEP-FUNCTION” 
We shall evaluate the integral: 
x 
B(x) = | Mx”) dx’ (42) 
— 00 


where the function 6‘(x) is defined in (1). It can easily be seen that 0'(x) is 


E E 1 E E E 
S =>; ZZ = 5 -5 = xX SF 
equal to 0 for x 5 to 1 for x > and 107 + 5) for > <x > The 


variation of 0'*(x) with respect to x is shown in figure 4. When e — 0, 0(x) 
approaches the Heaviside “step-function” 0(x), which, by definition, is equal to: 


A(x) = 1 if x >0 
A(x) = 0 if x <0 (43) 
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A G(x) 


FIGURE 4 


Variation of the function 9% (x), whose 
derivative 5“)(x) is shown in figure 1. 
When s£ —=> 0, 6(x) approaches 
the Heaviside step-function 0(x). 


| 
Nl 
p e i 
Nolo 
x 


ô®(x) is the derivative of 0“(x). By considering the limit e —» 0, we see 
that d(x) is the derivative of 0(x): 
d 


dx A(x) = (x) (44) 


Now, consider a function g(x) which has a discontinuity o, at x = 0: 


Lim g(x) — Lim g(x) = c (45) 


x>0+ = 


Such a function can be written in the form g(x) = g,(x)0(x) + g,(xJ0(— x), where 
g,(x) and g,(x) are continuous functions which satisfy g,(0) — g,(0) = oo. If we 
differentiate this expression, using (44), we obtain: 


g(x) Ox) + go (x) O(— x) + g,(x)0(x) — ga(x)ó(— x) 
g(x) A(x) + ga(x) O(— x) + 0, d(x) (46) 


g'(x) 


according to properties (19) and (27) of 6. For a discontinuous function, there is 
then added to the ordinary derivative [the first two terms of (46)] a term proportional 
to the ó-function, the proportionality coefficient being the magnitude of the func- 
tion's discontinuity *. 


COMMENT: 


The Fourier transform of the step-function 0(k) can be found simply 
from (12). We get: 


+20 W . 
| 6(k) e** dk = Lim | dk eli) = Lim — = iP + nd(x) (47) 
-£ 0 


£7 04 270, X + ig 


* Of course, if the function is discontinuous at x = Xo, the additional term is of the form : 
[gi (xo) — gz2(xo)lð(x — xo). 
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b. DERIVATIVES OF 3 


By analogy with the expression for integration by parts, the derivative d'(x) 
of the 6-function is defined by the relation *: 


| dx 0 (x) f(x) = — | dx d(x) f(x) = — f'(0) (48) 

From this definition, we get immediately : 

ô'(— x) = — 6"(x) (49) 
and: 

xd'(x) = — d(x) (50) 


Conversely it can be shown that the general solution of the equation: 
xu(x) = 0(x) (51) 


can be written: 
u(x) = — 9 (x) + có(x) (52) 


where the second term arises from the homogeneous equation [cf. formulas (25) 
and (26)]. 
Equation (34) allows us to write ó'(x) in the form: 


1 +% ; i i + 00 oe 
6'(x) = al a(2) ejpx/h = x | kdke k (53) 


The nth-order derivative ô™(x) can be defined in the same way: 


| dx 5x) f(x) = (— 1" f0) (54) 
Relations (49) and (50) can then be generalized to the forms: 

ô™(— x) = (— If 6™(x) (55) 
and: 

x(x) = — n9 (x) (56) 


4. The ô-” function” in three-dimensional space 


The ó-“function” in three-dimensional space, which we shall write simply 
as ó(r), is defined by an expression analogous to (4): 


| dr òlr) f(r) = (0) (57) 


* 5'(x) can be considered to be the limit, for e —> 0, of the derivative of one of the functions 
given in $ [-b. 
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and, more generally: 
| dr ile — ro) fle) = fro) (58) 


ó(r — rọ) can be broken down into a product of three one-dimensional 
functions: 


lr — ro) = (x — xo) ölyv — vo) O(Z — Zo) (59) 


or, if we use polar coordinates: 


g 1 A g 
òlr — ro) = eae òlr — ro) (A — 00) HP — Go) 
= + ó(r = ro)ó(cos 0 — cos 0) (o — po) (60) 
: 


The properties stated above for 6(x) can therefore easily be generalized to ò(r). 
We shall mention. in addition, the important relation: 


a(+) = — 4r Ar) (61) 


where A is the Laplacian operator. 


Equation (61) can easily be understood if it is recalled that in electrostatics, an electrical 
point charge q placed at the origin can be described by a volume density p(r) equal to: 


p(t) = gó(r) (62) 


We know that the expression for the electrostatic potential produced by this charge is: 


ma 63 
r) = = 
Áme, Y (63) 
Equation (61) is thus simply the Poisson equation for this special case: 
I 
AU(r) = — — plr) (64) 
"0 


To prove (61) rigorously, it is necessary to use mathematical distribution theory. We shall 
confine ourselves here to an elementary “proof”. 

First of all, note that the Laplacian of l/r is everywhere zero, except, perhaps, at the 
origin, which is a singular point: 


d? 2d\1 
— + 0 forr 4 0 (65) 
dr? rdr/ f 


Let g,(r) be a function equal to 1/r when r is outside the sphere S,, centered at O and 
of a radius e, and which takes on values (of the order of 1/s) inside this sphere such that g,(r) 
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is sufficiently regular (continuous, differentiable, etc.). Let f(r) be an arbitrary function 
of r which is also regular at all points in space. We now find the limit of the integral: 


He) = [er f(r) Ag,(r) (66) 


for e —> 0. According to (65), this integral can receive contributions only from inside the 
sphere S,, and: 


Ke) = | dr f(r) Ag,(r) (67) 


r<e 


We choose ¢ small enough for the variation of f(r) inside S, to be negligible. Then: 


le) = rof dr Ag,(r) (68) 


Transforming the integral so obtained into an integral over the surface -4, of S,. we obtain: 


Ie) = (0) | Vg,(r).dn (69) 
$, 


Se 
Now, since g,(r) is continuous on the surface “, . we get: 


w| -| eie (70) 


Ile) = f(0) x 4ne? x ES 
& 
~ — 4nf(0) (1) 
that is: 
Pa | Pr Agr) f(r) = — 4af (0) (7e) 


According to definition (57), this is simply (61). 


Equation (61) can be used, for example, to derive an expression which is 
useful in collision theory (cf. chap. VIII): 


= — 4r d(r) (73) 


To do so, it is sufficient to consider e* *"/r as a product: 


Call (ne Gee ene a 6 on 
A = — Afe**) + et 4(—) + 20(—) . Wet) (74) 
r r F r 
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Now: 
E y, 
V(e**) = + iket" — 
r 


i : 2ik ,, 
A(e**) o k2 et ike ae (75) 


We therefore find, finally : 


„tikr k2 Pik 2 
(A + ey a | 2 + l An òlr) Lu 2 x (+ ik) + £] e tikr 
r r r2 r? ie 


— 4n c*™ Ar) 
= — 4n d(r) (76) 


according to (27). 


Equation (61) can, furthermore, be generalized : the Laplacian of the function 
¥"(0. p)r*** involves /th-order derivatives of ó(r). Consider, for example cos 0/»?. 


We know that the expression for the electrostatic potential created at a distant point 


s 0) . 
aoe .If g is the absolute 


ty an electric dipole of moment D directed along Oz ts 


> 


Anteo F° 
value of each of the two charges which make up the dipole and a is the distance 
between them. the modulus D of the dipole moment is the product qa, and the 
corresponding charge density can be written: 


plr) = as(e — 5e.) — aò(r +50) (77) 


(where e, denotes the unit vector of the Oz axis). If we let a approach zero, while 
maintaining D = qa finite, this charge density becomes: 


ole) —> DE òl) (78) 


Therefore in the limit where a —= 0, the Poisson equation, (64), yields: 


cos 0 Cc. 
al > ) = — an o(r) (79) 


r 


Of course, this formula could be justified as (61) was above, or proven by distribution 
theory. Analogous reasoning could be applied to the function Y"(6, y)/r'*! which 
gives the potential created by an electric multipole moment 27 located at the origin 
(complement E,). 


References and suggestions for further reading: 


See Dirac (1.13) $15, and, for example, Butkov (10.8), chap. 6, or Bass (10.1), 
vol. 1, §§ 21.7 and 21.8; section 10 of the bibliography, especially the subsection “Fourier 
transforms; distributions”. 
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Appendix IlI 


LAGRANGIAN AND HAMILTONIAN 
IN CLASSICAL MECHANICS 


1. Review of Newton’s laws 


a. Dynamics of a point particle 
b. Systems of point particles 
c. Fundamental theorems 


2. The Lagrangian and Lagrange’s equations 
3. The Hamiltonian and the canonical equations 


a. The conjugate momenta of the coordinates 
b. The Hamilton-Jacobi canonical equations 


4. Applications of the Hamiltonian formalism 


a. A particle in a central potential 

b. A charged particle in an electromagnetic field 
a. Description of the electromagnetic field. Gauges 
B. Equations of motion and the Lagrangian 
y. Momentum. The classical Hamiltonian 


5. The principle of least action 


a. Geometrical representation of the motion of a system 
b. Statement of the principle of least action 
c. Lagrange's equations as a consequence of the principle of least action 


We shall review the definition and principal properties of the Lagrangian 
and the Hamiltonian in classical mechanics. This appendix is not meant to be 
a course in analytical mechanics. Its goal is simply to indicate the classical basis 
for applying the quantization rules (cf. chap. III) to a physical system. In particular, 
we shall concern ourselves essentially with systems of point particles. 


1. Review of Newton's laws 


a. DYNAMICS OF A POINT PARTICLE 


Non-relativistic classical mechanics is based on the hypothesis that there 
exists at least one geometrical frame, called the Galilean or inertial frame, in which 
the following law is valid: 

The fundamental law of dynamics : a point particle has, at all times, an accele- 
ration Y which is proportional to the resultant F of the forces acting on it: 


F = my (1) 


The constant m is an intrinsic property of the particle, called its inertial mass. 
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It can easily be shown that if a Galilean frame exists, all frames which are 
in uniform translational motion with respect to it are also Galilean frames. This 
leads us to the Galilean relativity principle : there is no absolute frame; there is no 
experiment which can give one inertial frame a privileged role with respect to all 
others. 


b. SYSTEMS OF POINT PARTICLES 


If we are dealing with a system composed of n point particles, we apply the 
fundamental law to each of them* : 


mY =F; Ln (2) 


t 


The forces which act on the particles can be classed in two categories : internal 
Jorces represent the interactions between the particles of the system, and external 
Jorces originate outside the system. The internal forces are postulated to satisfy 
the principle of action and reaction: the force exerted by particle (i) on particle (j) 
is equal and opposite to the one exerted by (j) on (i). This principle is true for 
gravitational forces (Newton’s law) and electrostatic forces, but not for magnetic 
forces (whose origin is relativistic). 

If all the forces can be derived from a potential, the equations of motion, (2). 
can be written: 


mi, = — VV (3) 


where V, denotes the gradient with respect to the r; coordinates, and the potential 
energy V is of the form: 


V => Vr) + E Ve, — r) (4) 


I i<j 


(the first term in this expression corresponds to the external forces and the second 
one, to the internal forces). In cartesian coordinates, the motion of the system is 
therefore described by the 3n differential equations: 


m¡X,= — T] 
CX; 
A 
m; Y, = — > i = 1,2, n (5) 
T eV 
m, 3; = az) 


* In mechanics, a simplified notation is generally used for the time-derivatives; by definition, 
du . du 


ù = — Ü = — + ele... 
di di” 
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c. FUNDAMENTAL THEOREMS 


We shall first review a few definitions. The center of mass or center of gravity 
of a system is the point G whose coordinates are: 


n 
Y mr; 
-1 


pa (6) 


r 


Ms 


The total kinetic energy of the system is equal to: 


n 1 
: : z 
T= Y mt 


ini” 


(7) 


¿ is the velocity of particle (i). The angular momentum with respect to the 
origin is the vector: 


where r; 


ge y r; X M È; (8) 
i l 


The following theorems can then be easily proven: 

(i) The center of mass of a system moves like a point particle with a mass 
equal to the total mass of the system, subject to a force equal to the resultant of all 
the forces involved in the system: 


n 


no un (9) 


i l 


Yom, 


a t 


(ii) The time-derivative of the angular momentum evaluated at a fixed point 
is equal to the moment of the forces with respect to this point: 


n 
L=} rF, (10) 
i t 
(iii) The variation of the kinetic energy between time f, and f, is equal to 
the work performed by all the forces during the motion between these two times: 


T(t,) - T(t,) -f y Kor. dí (11) 
moil 


If the internal forces satisfy the principle of action and reaction, and if they 
are directed along the straight lines joining the interacting particles, their contribution 
to the resultant [equation (9)] and to the moment with respect to the origin 
[equation (10)] is zero. H. in addition, the system is isolated (that is. if it is not 
subject to any external forces). the total angular momentum Ñ is constant, and the 
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center of mass is in uniform rectilinear motion. This means that the total mechanical 
momentum : 


BD m; È; (12) 
is also a constant of the motion. 


2: The Lagrangian and Lagrange's equations 


Consider a system of» particles in which the forces are derived from a potential 
energy [cf. formula (4)]. which we shall write simply I(r;). The Lagrangian. or 
Lagrange’s function. of this system is the function of 67 variables 


| TE ee we A ! 
Xp Ve a Aa Vp E ly Zea | 


given by: 
4 (r;, E) = T aad V 
l 


=5 y m, E? — V(r) (13) 
LiT 


It can immediately be shown that the equations of motion written in (5) are identical 
to Lugrange's equations: 


dos e 
A a Ei) 
dr cx, Ox; 
aa) A 
d aed L -0 (14) 
dt cy; CV; 
dez 04 
ie 


A very interesting feature of Lagrange’s equations is that they always have 
the same form, independent of the type of coordinates used (whether they are 
cartesian or not). In addition, they can be applied to systems which are more 
general than particle systems. Many physical systems (including for example one or 
several solid bodies) can be described at a given time ¢ by a set of N independent 
parameters q, (= 1. 2, .... N) called generalized coordinates. Knowledge of 
the y, permits the calculation of the position In space of any point of the system. 
The motion of this system is therefore characterized by specifying the N functions 
of time q;(1). The time-derivatives g,(7) are called the generalized velocities. The 
state of the system at a given instant /, is therefore defined by the set of q (2) and 
Gilto). If the forces acting on the system can be derived from a potential energy 
Vidis das- Gy). the Lagrangian 7 (qi Go dẹ: di Gos << Gy) 18 again the 
difference between the total kinetic energy T and the potential energy PF. h can be 
shown that, for any choice of the coordinates q; the equations of motion can 
always be written: 
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d : ae 
where T denotes the total time-derivative 


O N A 


+> aw (16) 


d C 
> op 


N 
dTa tA aa 


Furthermore, it is not really necessary for the forces to be derived from a potential 
for us to be able to define a Lagrangian and use Lagrange’s equations (we shall see 
an example of this situation in $ 4-b). In the general case, the Lagrangian is a function 
of the coordinates q, and the velocities g;. and can also be explicitly time- 
dependent*. We shall then write it: 


X (4,40) (17) 


Lagrange’s equations are important in classical mechanics for several reasons. 
For one thing, as we have just indicated, they always have the same form, independent 
of the coordinates which are used. Furthermore. they are more convenient than 
Newton's equations when the system is complex. Finally, they are of considerable 
theoretical interest, since they form the foundation of the Hamiltonian formalism 
(cf. $ 3 below), and since they can be derived from a variational principle ($ 5). The 
first two points are secondary as far as quantum mechanics is concerned, since 
quantum mechanics treats particle systems almost exclusively and since the quanti- 
zation rules are stated in cartesian coordinates (cf. chap. III, $ B-5). However. the 
last point is an essential one, since the Hamiltonian formalism constitutes the point 
of departure for the quantization of physical systems. 


3. The classical Hamiltonian and the canonical equations 


For a physical system described by N generalized coordinates, Lagrange’s 
equations, (15), constitute a system of N coupled second-order differential equations 
with N unknown functions, the q;(+). We shall see that this system can be replaced 
by a system of 2N first-order equations with 2N unknown functions. 

a. THE CONJUGATE MOMENTA OF THE COORDINATES 


The conjugate momentum p, of the generalized coordinate q; is defined as : 


Pi Sar ! (18) 


p; İs also called the generalized momentum. In the case of a particle system for which 
the forces are derived from a potential energy, the conjugate momenta of the position 
variables r;(x; VY; 2;) are simply [see (13)] the mechanical momenta: 

Pp: = m;t; (19) 
However, we shall see in $ 4-b-y that this is no longer true in the presence of a magnetic 
field. 


* The Lagrangian is not unique : two functions 4 (gi di: 1) and 4 (q, di: £) may lead. using (15). 
to the same equations of motion. This is true. in particular, if the difference between 4 and 4’ is the total 
derivative with respect to time of a function Has: t): 


d 
41 4 =e a) = ot Las 
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Instead of defining the state of the system at a given time f¢ by the N coordi- 


nates qg,(t) and the N velocities q,(t), we shall henceforth characterize it by the 
2N variables: 


alo) pilt) i =1,2,..., N} (20) 


This amounts to assuming that from the 2M parameters q;(t) and p;(t), we can 
determine the q;(£) uniquely. 


b. THE HAMILTON-JACOBI CANONICAL EQUATIONS 


The classical Hamiltonian, or Hamilton's function, of the system is, by defi- 
nition : 
N 
X = Y pd L (21) 


In accordance with convention (20), we eliminate the q, and consider the Hamiltonian 
to be a function of the coordinates and their conjugate momenta. Like Y, # may 
be explicitly time-dependent: 


H (qi Pit) (22) 


The total differential of the function 4 : 


CH CH OM 
H = ES dp. Z 
BE de agp ttt I py BE (23) 
is equal to, using definitions (21) and (18): 
; r CL OF ., OF 
d¥ = Y [pi dá, + 4; dp; ] Lg Hi — Lag t4: a dt 
¿ OL CZ 
= 2 di dp; — a 14: == dt (24) 


Setting (23) and (24) equal, we see that the change from the { q,, q; } variables 
to the { q; p; } variables leads to: 


OH EX 
A AS (25-a) 
“4; qi 
CH _. 25-b 
CP; ~~ eu 
CH Cr 
ot ©) at eo 


Furthermore, using (18) and (25-a), we can write Lagrange’s equations (15) in the 
form: 


d _ A (26) 
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By grouping terms in (25-b) and (26), we obtain the equations of motion: 


dg, 0 

| dt CD; (27) 
| dp, OH ; 

' dt Og; | 


1 


which are called the Hamilton-Jacobi canonical equations. As we said, (27) 1s a system 
of 2N first-order differential equations for 2N unknown functions, the g,(t) and p,(t). 

For an n-particle system whose potential energy is V(r;), we have, according 
to (13): 


= y Pi- F; — 5Y mt + V(r;) (28) 


To express the Hamiltonian in terms of the variables r; and p,, we use (19). This 
yields: 


Hep) = Y y, + Vie) (29) 


Note that the Hamiltonian is thus equal to the total energy of the system. The 
canonical equations : 


Oo Bi 

Da m; (30) 
Py 

dr ViV 


are equivalent to Newton’s equations, (3). 


4. Applications of the Hamiltonian formalism 


a. A PARTICLE IN A CENTRAL POTENTIAL 


Consider a system composed of a single particle of mass m whose potential 
energy V(r) depends only on its distance from the origin. In polar coordinates (r, 0, y), 
the components of the particle’s velocity on the local axes (fig. 1) are: 


t =P 
bt ="r0 (31) 
to = r sin) o 
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so that the Lagrangian, (13), can be written: 


Hr, 0, p; F, 0, Oo) = 5 m [F? + r26? + r? sin?0 gp] — Vir) (32) 


/ FIGURE | 
/ The unit vectors e,, e,, €, of the local axes 
y associated with point M, where M is 
Y defined by its spherical coordinates r, 0, o. 


The conjugate momenta of the three variables r, 0, p can then be calculated: 
CL 
cr 
mA 
Pa = Bo: 
Py = = mr? sin?0 @ (33-c) 


P, = = mp (33-a) 


mr?0 (33-b) 


To obtain the Hamiltonian of the particle, we use definition (21). This amounts to 
adding V (r) to the kinetic energy, expressed in terms of r, 9, p and p,, py. Po: We find: 


2 2 
Py l 2 P 
H (r, 0, O: Pr. Po. = + +2 | + V(r 34 
( P,P Po Po) 2 mp? (x sin2 5) ( ) ( ) 


The system of canonical equations [formulas (27)] can be written here: 


dr CH  p, 

dt = Cp, = m (35-a) 
d) A 

A CA, = Le (35-b) 
di ¢p, mr 

d CH 

bcs ee A e (35-c) 


dt Cp, m sin? 0 
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2 
dp, _ _0% _ (a +e) _ (35-d) 
dt or mr? sin? 0 er 
2 
dpo _ _ OH _ Pe cos O (35-e) 
dt 00 mr? sin? 0 
dp OH 
SEO BEE 35-f 
dt 0p ot 


The first three of these equations simply give (33); the last three are the real equations 
of motion. 

Now, consider the angular momentum of the particle with respect to the 
origin: 


L=rxmy (36) 


Its local components can easily be calculated from (31): 


FL, =0 
L = — = — 2 gj = _ Po 
A mrv, mr? sin @ © Sin 0 
L, = Mv = mr?0 = py (37) 
so that: 
p? 


sin? 0 


From the angular momentum theorem [formula (10)], we know that .2 is a vector 
which is constant over time, since the force derived from the potential V(r) is central, 
that is, collinear at each instant with the vector r*. 

By comparing (34) and (38), we see that the Hamiltonian # depends on the 
angular variables and their conjugate momenta only through the intermediary of. £? : 


2 


2 
Hr, 9, P: Pe PoPa) == + L7(9, po Py) + Vir) (39) 
2m 2mr 


Now, assume that the initial angular momentum of the particle is .£¿. Since the 
angular momentum remains constant, the Hamiltonian (39) and the equation of 
motion (35-d) are the same as they would be for a particle of mass m, in a one- 
dimensional problem, placed in the effective potential: 


y Lo 


2mr? 


(40) 


* This conclusion can also be derived from (35-e) and (35-f) by calculating the time-derivatives 
of the components of 2 on the fixed axes Ox, Oy, Oz, 
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b. A CHARGED PARTICLE PLACED IN AN ELECTROMAGNETIC FIELD 

Now, consider a particle of mass m and charge q placed in an electromagnetic 
field characterized by the electric field vector E(r, t) and the magnetic field vector 
Bír, 1). 
a. Description of the electromagnetic field. Gauges 


E(r, t) and B(r, £) satisfy Maxwell's equations: 


V.E =£ (41-a) 
Eo 
ôB 
VxE=- =r (41-b) 
V.B =0 (41-c) 
ôE 


where p(r, t) and j(r, t) are the volume charge density and the current density 
producing the electromagnetic field. The fields E and B can be described by 
a scalar potential U(r, 1) and a vector potential A(r, t), since equation (41-c) implies 
that there exists a vector field A(r, t) such that: 


B = V x A(r, t) (42) 
(41-b) can thus be written: 
CA 
vxlE l= (43) 
Consequently, there exists a scalar function U(r, t) such that: 
E A — VU(r, t) (44) 


The set of the two potentials A(r, ¢) and U(r, t) constitutes what is called a gauge 
for describing the electromagnetic field. The electric and magnetic fields can be 
calculated from the { A, U } gauge by: 


Bir, t) = V x A(r, t) (45-a) 
Elr, t) = — VU(E, 1) — 5 Alr, t) (45-b) 


A given electromagnetic field, that is, a pair of fields E(r, 1) and B(r, £), can be 
described by an infinite number of gauges, which, for this reason, are said to be 
equivalent. If we know one gauge, { A, U }, which yields the fields E and B, all 
the equivalent gauges, { A’, U’ }, can be found from the gauge transformation 
formulas: 


A‘(r, t) = A(r, t) + Vy(r, t) (46-a) 
0 
U'(r, t) = U(r, t) — a x(r, t) (46-b) 
where x(r, £) is any scalar function. 
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First of all, it is easy to show from (46) that: 
Vx A'(r, t) = Vx Alr, t) 


A A 
— VU'(r, t) — < A(t, t) = — VU(r, t) — = Alr, 1) (47) 


Any gauge, { A’, U’ }, which satisfies (46) therefore yields the same electric and magnetic fields 
as { A, U}. 

Conversely we shall show that if two gauges, { A, U } and { A’, U’ }, are equivalent, 
there must exist a function y(r, £) which establishes relations (46) between them. Since, by 
hypothesis: 


Br, t) = Vx A(r, 1) = Vx A'(r, t) (48) 
we have: 
Vx (A’ — A) =0 (49) 


This implies that A’ — A is the gradient of a scalar function: 
A’—A=Vui(r, f) (50) 


z(r, 1) is, for the moment, determined only to within an arbitrary function of £, f (t). Furthermore, 
the fact that the two gauges are equivalent means that: 


ê 0 
E(r, t) = — VU(r, t) — 7 Ale t) = — VU'(r, t) — a A t) (51) 
that is: 
0 
V(U’ — U) e aed (52) 


According to (50), we must have: 


V(U' — U) = — vÊ, t) (53) 


Consequently, the functions U’ — U and — i x(r, £) can differ only by a function of t; thus, 
C 


we can choose f (t) so as to make them equal: 


0 
U' -U= -z r,t) (54) 


This completes the determination of the function y(r, 1) (to within an additive constant). Two 
equivalent gauges must therefore satisfy relations of the form (46). 


B. Equations of motion and the Lagrangian 
In the electromagnetic field, the charged particle is subject to the Lorentz force: 
F =q[E + v x B] (55) 
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(where v is the velocity of the particle at the time rt). Newton’s law therefore gives the 
equations of motion in the form: 


m¥ = q [Ef(r, £) + t x Bír, £)] (56) 


Projecting this equation onto Ox and using (45), we obtain: 


mX = q[E, + yB, — 2B,] 
dU CA . [OA OA 0A JA 
E O A O ls A oe e OS 
l x ðt E =) des =) | 67) 
It can easily be shown that these equations can be derived from the Lagrangian 
by using (15): 


are, 


Z Ar, r,t) = smi + qf. A(r, t) — qU(r, t) (58) 


Therefore, although the Lorentz force is not derived from a potential energy, we can 
find a Lagragian for the problem. 


Let us show that Lagrange’s equations (15) do yield the equations of motion (56), using 
the Lagrangian (58). To do so, we shail first calculate : 


Cr p 
~r = mx + gA,(r, t) 
Cx 
Cr ĉ ĉ 
= q t. — Afr, t) — g— U(r, t 
m Ly ea alr (59) 


Lagrange’s equation for the x-coordinate can therefore be written: 


~ 


d 0 0 
— [mx + qA,(r, )] — q t. —Alr, t) + q — Ulr, t) = 
q Link +94 dr, 0] — oF. Alr,1) +45 Ur) = 0 (60) 


Writing this equation explicitly and using (16), we again get (57): 


CA, 4 GAs A, OAS CA, | ¿CA | CA, 4 ou 0 
x—— Zz — q| x — —— + z — — = 
y dy 4 Cx y ex ex 4 ĉx 
(61) 


mX +d 


that is: 
oU @A CA CA CA OA 
X= — — — Sky) oe e PMA Enf. 62 
A d| ĉx ôt i( Ox Cy ) ¿ Oz ex ) 2) 
Y. Momentum. The classical Hamiltonian 


The Lagrangian (58) enables us to calculate the conjugate momenta of the 
cartesian coordinates x, y, z Of the particle. For example: 
OF 


Py = — = mx + QA,(r, t) (63) 
OX 
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The momentum of the particle, which is, by definition, the vector whose components 
are (DP. Py P-). is no longer equal. as in (19), to the mechanical momentum mt: 


p= mt + qA(r. t) (64) 
Finally. we shall write the classical Hamiltonian: 


Ar p:t)= p.t- Y 


= l y 2 q 
=p. (P — dA) — =~ (p — yA) „mP 4A).A +qU (65) 
that is: 
l 
H (r, p: t) = or [p — qA(r, 19]? + qUír. 1) (66) 
COMMENT: 


Hamiltonian formalism therefore uses the potentials A and U, and not 
the fields E and B directly. The result is that the description of the particle 
depends on the gauge chosen. It is reasonable to expect, however. since the 
Lorentz force is expressed in terms of the fields, that predictions concerning 
the physical behavior of the particle must be the same for two equivalent gauges. 
The physical consequences of the Hamiltonian formalism are said to be 
gauge-invariant. The concept of gauge invariance is analyzed in detail in 
complement H,,,. 


5. The principle of least action 


Classical mechanics can be based on a variational principle. the principle of least action. 
In addition to its theoretical importance. the concept of action serves as the foundation of the 
Lagrangian formulation of quantum mechanics (cf. complement Jiu). This is why we shall now 
briefly discuss the principle of least action and show how it leads to Lagrange’s equations. 


a. GEOMETRICAL REPRESENTATION OF THE MOTION OF A SYSTEM 


First of all, consider a particle constrained to move along the Ox axis. Hs motion 
can be represented by tracing, in the (x. 1) plane, the curve defined by the law of motion which 
yields x(1). 

More generally, let us study a physical system described by N generalized coordinates q; 
(for an n-particle system in three-dimensional space. N = 3»). It is convenient to interpret the q; 
to be the coordinates of a point O in an N-dimensional Euclidean space R,. There is then 
a one-to-one correspondence between the positions of the system and the points of Ry. 
With each motion of the system is associated a motion of point O in Ry. characterized by 
the N-dimensional vector function Q(t) whose components are the q;(t). As in the simple case 
of a single particle moving in one dimension, the motion of point Q. that is, the motion of the 
system, can be represented by the graph of Q(z), which is a curve in an (NV + 1)-dimensional 
space-time (the time axis is added to the N dimensions of Ry). This curve characterizes the 
motion being studied. 
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b. THE PRINCIPLE OF LEAST ACTION 


The g,(¢) can be fixed arbitrarily; this gives point O and the system an arbitrary motion. 
But their real behavior is defined by the initial conditions and the equations of motion. Suppose 
that we know that, in the course of the real motion. Q is at Q, at time 7, and at Q, at a 
subsequent time f, (as is shown schematically by figure 2): 


Q(t) = 0 
Ql) =Q, (67) 


There is an infinite number of « priori possible motions which satisfy conditions (67). They are 
represented by all the curves*, or paths in space time. which connect the points (Q,, f,) 


and (Q3, 12) (cf. fig. 2). 


| FIGURE 2 


The path in space-time which is asso- 
ciated with a given motion of the 
Q, E oa physical system. The -axis represents 

| the time and the “y-axis”, Q (which 
; symbolizes the set of generalized coor- 
dinates q;). 


Consider such a path in space-time, F, characterized by the vector function Q(t) which 
satisfies (67). If: 


IG) das In idz Ini = ZQ. Q: 1) (68) 


is the Lagrangian of the system, the action Sy which corresponds to the path F is, by definition: 


Sp = | di 210,(0). 0,0): 1] (69) 


1 


[the function to be integrated depends only on /; it is obtained by replacing the q, and q, by the 
time-dependent coordinates of Q,(r1) and O,(£) in the Lagrangian (68)]. 

The principle of least action can then be stated in the following way : of all the paths 
in space-time connecting (Q,, 1,) with (Q,. 1,). the one which is actually followed (that is, the 
one which characterizes the real motion of the system) is the one for which the action is minimal. 
In other words. when we go from the path which is actually followed to one infinitely close 
to it, the action does not vary to first order Note the analogy with other variational principles, 
such as Fermat’s principle in optics. 


* Excluding, of course. the curves which “go backward”. that is. which would give two distinct 
positions of Q for the same time £. 
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c. LAGRANGE'S EQUATIONS AS A CONSEQUENCE OF THE PRINCIPLE 
OF LEAST ACTION 


In conclusion, we shall show how Lagrange’s equations can be deduced from the principle 
of least action. 

Suppose that the real motion of the system under study is characterized by the N functions 
of time q(t), that is by the path in space-time I connecting the points (Q,, t,) and (Q3, t,). 
Now consider an infinitely close path, T’ (fig. 3), for which the generalized coordinates are 
equal to: 


qilt) = qt) + 6q,(t) (70) 
where the 6q,(t) are infinitesimally small and satisfy conditions (67), that is: 


dq,(t,) = 6q,(t,) = 0 (71) 


FIGURE 3 


Two paths in space-time which 
pass through the points (Q,, t,) 
and (Q,, ¢,): the solid-line curve 
is the path associated with the real 
motion of the system, and the 
dashed-line curve is another, infi- 
nitely close, path. 
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The generalized velocities g;(t) corresponding to T’ can be obtained by differentiating rela- 
tions (70): 


; ; ds 
dit) = dde) +> dado (72) 
t 
Thus, their increments 69,(t) are simply: 


041) = — dat) (73) 


t2 2 g 
ez cx d 
= dt — ðq; + > —— ôq, 
| b ag, 4 2 ai, +60 (74) 
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according to (73). If we integrate the second term by parts, we obtain: 


A ¡OO dd 
òS = [z T òa: +| | 


a ti 


t2 ay A 
-Í TO (75) 


cg dt cd; 


OF dor 
qa, dt ĉd; 


since the integrated term is zero, because of conditions (71). 

If T is the path in space-time which is actually followed during the real motion of the 
system, the increment ôS of the action is zero, according to the principle of least action. 
For this to be so, it is necessary and sufficient that: 


dêg oF 


rs eet a () > p= 1,2,..,N 
dt ôdi Cg; | ee 


It is obvious that this condition is sufficient. It is also necessary, since, if there existed a time 
interval during which expression (76) were non-zero for a given value k of the index i, 
the ôq;(t) could be chosen so as to make the corresponding increment 6S different from zero. 
[eZ deL | 
(It would suffice, for example, to choose them so as to make the product 6q,| == — === 
eq, dt Cd, | 
always positive or zero). Consequently, the principle of least action is equivalent to Lagrange’s 
equations. 


References and suggestions for further reading: 


See section 6 of the bibliography, in particular Marion (6.4), Goldstein (6.6), 


Landau and Lifshitz (6.7). 
For a simple presentation of the use of variational principles in physics, 


see Feynman II (7.2), chap. 19. 
For Lagrangian formalism applied to a classical field. see Bogoliubov and 


Chirkov (2.15). chap. I. 
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N.B, References followed by (e) refer to an exercise 


Absorption 
— collision with 951 
— ofa quantum, of a photon 1293, 1319, 1332 
~ of several photons 1335, 1339 (e) 


Acceptor (electron) 1445 
Acetylene (structure) 849 
Accidental degeneracy : see degeneracy 

Action 335, 1497 
Adiabatic (turning on of a potential) 912, 931 


Addition 
— of angular momenta 


997, 1027, 1087 (e), 1091 (e) 
~ of spherical harmonics 1043 
Adjoint (operator) 117 
Allowed bands 373, 378, 1156, 1432 
Ammonia 455, 845 
Amplitude 
~ probability 253, 258 
— scattering 909, 934 
Angular momentum 
— classical 1486 
— quantum theory 641 
~ commutation relations 644, 698, 702 
~ orbital 643, 644, 660 
— intrinsic : see spin 
— “standard” representations 652, 666 
~ relation to rotations 690 
— conservation 643, 709, 999 
~ addition 997, 1027, 1087 (e), 1091 (e) 
~ coupling 999, 1072 
— of a photon 1337 
— of two identical particles 1449 (e) 


Anharmonicity : see oscillator (anharmonic) 
Annihilation (or destruction) operator 489, 498 


Antibonding (molecular state) 1179 
Anticrossing (of levels) 409, 468 
Antiresonant term 1293 
Antisymmetric state 1380, 1384 
Antisymmetrizer 1380, 1384 
Atom (s) 

~ see hydrogen, helium, hydrogen-like 

— many electron 1410, 1418 

— muonic 524, 811 

~ hadronic 812 
Autler-Townes effect 1359 (e) 
Autoionization 1345, 1419 
Azimuthal (quantum number) 783 


Bands (energy) 
— allowed or forbidden 
373, 378, 382, 1156, 1432 
— valence, conduction 1444 
Barrier (potential) 
— square 36, 72, 365 
— delta function 86 (e) 
— arbitrary shape 359 
Basis (state space) 
— discrete 97, 121 
— continuous 105, 121 
— mixed 105, 138 
— characteristic relations 107, 124, 146 
Benzene 411, 480 (e) 
Bessel 
— Bessel-Parseval relation 1462 
— spherical Bessel functions 
924, 944, 945, 946 
Black body 624 
Bloch 
— equations 451, 1324, 1326 
— functions 1164 
— theorem 632, 1165 
Bohr 
— frequency 249 
— model 45, 791 
— radius endpaper, 792 


— see magneton 


Boltzmann constant endpaper 

Bond (chemical) 412, 841, 1169, 1191 

Bonding state (molecular) 1178 

Boundary conditions 352, 375 
— see periodical 

Born approximation 919, 957, 1302 


Born-Oppenheimer approximation 512, 1156 


Born-von Karman conditions 1441 
Bose condensation 1399 
Bose-Einstein 

— statistics 1399 

— distribution law 625 
Bosons 1386 
Bra 110, 125 
Bragg reflection 374, 1168 
Brillouin 

— zones 589, 1163 

— formula 440 
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INDEX 

Canonical — absorption 953 
— equations 1488 — total elastic 906, 952 
— commutation relations 150, 222 — total 956 

Cauchy principal part 1471 CSC.O. 143, 159, 206 (e), 235 

Center of mass 784, 1484 Current (probability) 238, 280, 327, 343 (e), 824 


Central (potential or field) 
— stationary states in a 776 
— scattering by a 
— approximation 
Centrifuge potential 


921 
1410, 1418 
781, 859, 864 


Characteristic equation of an operator 134 
Chemical 

— bond 412, 841, 1169, 1191 

— potential 1437 
Circular quanta 733, 754 
Classification (periodic) of the elements 1414 
Clebsch-Gordan coefficients 1023, 1035 
Closure relation 99, 122 
Coherences (of the density matrix) 302 
Coherent superposition of states 252, 296 
Collisions 903 

— with absorption 951 


— between identical particles 
1373, 1403, 1407, 1448 (e) 
— collision parameter 926, 954 
Combinations (linear) of atomic orbitals 


(L.C.A.O.) 1151, 1173 
Commutation 

— canonical relations 150, 222 

— compatibility and commutativity 231 
Commutator 168, 171, 174 
Compatible observables 231 
Complementarity 50 


Complete (Complete Set of Commuting 
Observables : C.S.C.O.) 143, 159, 206 (e), 235 

Composition : see addition 

Compton wavelength 


798, 1216, endpaper 
Condensation (Bose) 1399 


Conduction bands 1444 
Conductivity (electric) of a solid 1443 
Conductor 1443 
Configuration (electronic) 1414, 1418 
Conjugate momentum 213, 1488 
Conjugation (hermitian) 116 
Conservation 

— of probability 237, 238 

— of energy 247 

— of angular momentum 643, 709, 999 
Conservative systems 244, 310 
Constants of the motion 247, 313 


1219, 1228, 1260 
138, 218, 263 
1297, 1344 

1464 


Contact term 
Continuous spectrum 
Continuum of final states 
Convolution product 
Correlations between two systems 

293, 307, 1084, 1136 


Coulomb integral 1175 
Coupling 
— between two states 407 
— of angular momenta 999, 1072 
— spin-orbit 1001, 1215, 1223, 1261 
Creation operator 489, 498 


Cross section 


— scattering 905, 913, 934, 952 
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— in a scattering experiment 913 
Cyclotron frequency 743, 831, endpaper 


Darwin term 1216, 1260 
De Broglie 
— relation 18 
— wavelength 18, 42 
Decay 1344 


Decomposition (principle of spectral) 15, 19, 216 
Decoupling (fine or hyperfine structure) 
1245, 1274 
Degeneracy 
— of an eigenvalue 132, 217, 259 
— systematic (or essential) and accidental 
201, 783, 798, 818 


— parity 198 
— rotation 710, 1056 
— exchange 1375, 1387 


— removal by a perturbation 1098, 1106 
Delta function (Dirac) 1468 
— use in quantum mechanics 103, 112, 278 
— delta function wells and barriers 
86 (e), 87 (e), 88 (e), 1361 (e) 
Density 
— operator and matrix 295, 437 
— ofstates 381, 1166, 1297, 1346, 1435, 1440 
— one- and two-particle density 
function 1454 (e) 
Destruction operator: see annihilation ope- 
rator 


Determinant (Slater) 1390 


Deuterium 806, 1086 (e) 
Diagonalization 
— of an operator 132 
— ofa2 x 2 matrix 420 
Diamagnetism 828, 870 (e) 
Dipole (electric) 
— approximation 1307, 1309 
— transitions and selection rules 838, 1310 
— moment 1062 
Dipole (magnetic) 
— Hamiltonian 1312 


— transitions and selection rules 
1066, 1078, 1313, 1315, 1324 


— moment 1066 
Dipole-dipole interaction 
— between two electric moments 1133 


— between two magnetic moments 
1120, 1219, 1252 


Dirac 
— notation 108 
— equation 1213, 1226 
— see delta, Fermi 
Distribution 
— Bose-Einstein 626 
— Fermi-Dirac 1437 
Donor atom 809, 1445 
Doppler effect 1365 (e) 
Duality (wave-particle) 11, 42, 50 
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Eckart : see Wigner 

Ehrenfest theorem 242, 314 
Eigen (-values and -vectors)19, 33, 132, 157, 215 
Einstein 


— Planck-Einstein relations 10 
— model 518. 626 
— temperature 632 
— Bose-Einstein distribution law 625 

— Einstein, Podolsky, Rosen paradox 
94, 1084 
— coefficients 1321 
Elastic scattering 904 


Elastically bound (electron) 1315 
Electric dipole : see dipole 
Electric quadrupole : see quadrupole 


Electrodynamics (quantum) 618, 1227, 1260 


Electromagnetic 
— fields and potentials 315, 1493 
— field quantization 605 


— interaction of an atom with an electro- 


magnetic wave 1304 
Electron 

— mass and charge endpaper 

— spin 965 

— free electron gas 1432 

— conduction electron 1444 

Elements (periodic classification of) 1414 


Elliptic coordinates 1171 
Emission 
— induced 
— spontaneous 
337, 619, 1282, 1321, 
Energy : see conservation, uncertainty 
Essential (or systematic) degeneracy 
201, 710, 784, 1056 


1293, 1319, 1332 
1345, 1355 


Ethane molecule 1202 (e) 
Ethylene 

— structure 853 

— torsion oscillation 520 
Evanescent wave 37, 71, 74, 81, 282 
Evolution 

— of the vector state 222 

— of the mean values 240 

— Operator 308 
Exchange 

— degeneracy 1376, 1389 

— term 1399, 1405 

— integral 1426 
Exciton 810 


Exclusion principle 
1389, 1396, 1414, 1433, 1436, 1445 


Fermi 
— golden rule 1299 
— energy 1398, 1432, 1437 
— level 1437, 1438, 1443 
— see: Fermi-Dirac 
Fermi-Dirac 
— Statistics 1399 
— distribution 1437 
Fermions 1386 
Ferromagnetic 1428 


INDEX 
Feynman 
— postulates 335 
— Hellmann-Feynman theorem 1192 
Final states 
— continuum of 1297, 1344 
— density of 1297, 1346 
Fine structure 
— constant 797, endpaper 
— of the spectral lines 968 
— Hamiltonian 1213, 1257, 1429 
— of Hydrogen 1219 
— of Helium 1429 
Forbidden 
— see band 
— transition 1311 
Form factor 1360 (e) 
Fourier series and transformations 1459 
Free particle 22 
— stationary states with well defined 
momentum 27, 923, 947 


— stationary states with well defined 
angular momentum 921, 928, 938 
— wave packets 28, 61, 342 (e) 
Function (wave) 19, 94, 146, 225 


Gauge 315, 1309, 1493 
Gerlach : see Stern 
Green functions 
Ground state 
Group velocity : see velocity 
Gyromagnetic ratio of the electron 
— orbital 832 
— spin 383, 443, 970 


331, 915 
47, 356, 494, 505 


Hj (molecular ion) 88 (e), 412, 415, 1169 


Hadronic atoms 812 
Hall effect 1445 
Hamiltonian 222, 244, 1488 

— ofa particle in a scalar potential 223 


— ofa particle in a vector potential 
224, 317, 322, 1496 
— ofa particle in a central potential 778, 1490 


Hamilton-Jacobi equations 213, 1489 
Hanle effect 1339 (e) 
Hard sphere scattering 960, 962 (e) 
Harmonic 

— see oscillator 

— generation of harmonics 1334, 1336 
Harmonics (spherical) 664, 678 

— expressions for / = 0, 1, 2 682 

— addition of 1043 
Heaviside step function 1475 
Heisenberg 


— uncertainty relation 
27, 45, 47, 51, 59, 231, 287 
— picture 312 
Helium 


— isotopes *He and *He 1387, 1399, 1431 


— energy levels 1418 
— He* ion 810 
— solidification 519 
Hellmann-Feynmann theorem 1192 
Hermite polynomials 501, 529, 541 
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INDEX 
Hermitian 
— Operator 120, 129, 136 
— conjugation 116 
Hole 1445 
Hybridation (orbital) 841, 1184 
Hydrogen atom 
— Bohr model 45, 791 
— quantum theory 47, 792 


— ionization energy 792. endpaper 
— stationary wave functions 802 


— ina magnetic field 827, 829 
— fine structure 1219 
— relativistic energies 1226 
— hyperfine structure 1227 
— Zeeman effect of the fine structure 

1245, 1270 

— Zeeman effect of the hyperfine structure 
1232 
— Stark effect 1279 
— Lamb shift 619, 1227 
— hydrogen maser 1232 
Hydrogen-like systems 805, 1145 


Hyperfine 
— structure: see hydrogen, muonium, 
positronium, Zeeman effect 


— Hamiltonian 1217, 1247 
Identical particles 1370 
Impact (or collision) parameter 926, 954 
Induced : see emission 
Inequality (Schwarz) 165 
Instability 337, 470, 619, 1344 
Insulator 1444 
Interaction picture 347 (e) 
Interferences 11, 50, 253, 1399 
Intrinsic angular momentum : see spin 
Invariance (Gauge) 315 


Ionization energy : see hydrogen 


Jacobi : see Hamilton 


Ket 1388 
— see state 


Kuhn : see Thomas 


109, 124, 


Lagrange equations 213, 1487, 1498 


Lagrangian 213, 1487 
— of a particle in an electromagnetic field 
317, 1495 
— formulation of quantum mechanics 333 
Lamb shift 619, 1227, 1355 
Landau levels 742 
Landé factor 1055, 1086, 1237, 1274 
Laplacian 
— of I/r 1478 
— of ¥*(8, p)r'*! 1481 


Larmor precession 
390, 404, 443, 830, 831, 1238 


Lasers 1325, 1336 
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Legendre 
— polynomials 686 
- associated functions 687 
Levels 
— energy 33, 77, 351 
— two-level systems 387, 405, 1323, 1325 
— three-level systems 1359 (e) 


Lifetime 337, 470, 619, 1282, 1345, 1353 
Linear 
— see operators 
— response 1315, 1323, 1330 
— susceptibility 1331 


Magnetic dipole : see dipole 
Magnetic field 
— charged particle in a 
— hydrogen atom ina 
827, 829, 1232, 1245, 1270 
— harmonic oscillator in a 870 (e) 
— multiple level in a 1056 
Magnetic interactions 1212, 1217, 1429 
Magnetic quantum number 783 
Magnetic resonance : see resonance 
Magneton (Bohr) 


224, 240, 316, 742 


— electronic 830, endpaper 
— nuclear 1218, endpaper 
Maser 1232, 1325, 1336 
Mass 
— center of 784, 1486 
— reduced 785 
— relativistic corrections 1214, 1259 
Master equation 1324 
Matrix, matrices 124, 126 
— adjoint 129 
— Hermitian 129 
— unitary 178 
— Pauli 417 
— diagonalization of a2 x 2 matrix 420 
— density 295, 437 


Mean square deviation: see root mean 
square deviation 


Mean value of an observable 227, 240 
Measurement 
— general postulates 215, 225 
— state after a measurement 220, 226 
— ona spin 1/2 387, 398 


— on part of a physical system 290 
Mechanical momentum 
213, 225, 317, 322, 1487, 1496 
Mendeleieff classification 1414 
Metastable state 1282 
Methane structure 855 
Minimum wave packets 287, 507, 572 
Mixing (of one state with another) 
1102, 1116, 1269, 1282 
Mixture of states (statistical) 253, 296, 300, 438 
Molecule(s) 
— vibration 
— rotation 
— vibration-rotation 
— chemical bond 412, 841, 1169, 1191 
— structure of some 845, 849, 853, 855 
Moment(s): see electric dipole, magnetic 
dipole, quadrupole, multipole 


511, 1117 
712, 766 (e) 
856 
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Momentum 213, 222, 317, 322 
~ generalized 1488 
~ see angular, mechanical 
— conjugate 213, 225, 1488 


1365 (e) 
1059, 1090 (e) 


Mossbauer effect 
Multipole moments 


Muon 524, 807, 811, 1266 
Muonic atoms 524, 811, 1146 
Muonium 807, 1262 
Neumann spherical functions 947 
Neutron mass endpaper 
Non-linear 
— response 1323 
— susceptibility 1335 
Non-resonant excitation 1315 
Norm 
— of a wave function 20, 96, 107 
~ of a state vector 111 
~ conservation 237 
Normal variables 578, 591, 607, 614 
Normal modes of vibration 575, 586, 605 
Nuclear finite mass effect 807 
Nucleus 
— volume effect 526, 812, 1141, 1248 
— finite mass effect 807 
— spin 1070 
~ multipole moments 1070 
Numbers (quantum) 
— good 248 
— see azimuthal, magnetic, principal, 
radial 
Observables 136, 139, 215 
~ measurement 215, 225 
~ quantization 222 
— whose commutator equals ih 187, 286 
— for identical particles 1381, 1393 


— C.S.C.O. (Complete Set of Commuting 


Observables) 143, 159, 206 (e), 235 
Occupation number 1391 
Operators 

~ definition 96, 114 
— general properties 166 


— eigenvalue equation, diagonalization 
132, 158, 420 


— representation 126 
~ Hermitian 120, 136 
— unitary 176 
— function of an operator 169 
— restriction of an operator 168 
— density 295, 437 
— evolution 308 
~ parity 192 
~ translation 190, 556, 761 
~ rotation 693, 983 
~ scalar 705, 709 
— vector 705, 1048, 1089 (e) 


— tensor 1089 (e), 1090 (e) 
~ creation and annihilation 489, 498 
— permutation 1378, 1382 


Oppenheimer : see Born 


INDEX 
Optical theorem 956 
Orbital 
— angular momentum 643, 644, 660 
— state space 970 
Orbitals 
— atomic 841, 1447 (e) 
— molecular 1180, 1185 
— linear combination of atomic orbitals 
1151, 1173 


97, 105, 107, 122. 138 
412, 448, 1340 


Orthonormal basis 
Oscillation between two states 


Oscillator (anharmonic) 487, 1102 
Oscillator (harmonic) 
— one-dimensional 483, 511 
— two-dimensional 727 
— three-dimensional 547, 814, 870 (e) 
— in an electric field 552 
— in thermodynamic equilibrium 620 
— coupled oscillators 575, 586 
— quasi-classical states 559 
Oscillator strength 1318 
Overlap integral 1175 


Packet : see wave packet 


Paramagnetism 828, 870 (e), 1438 
Parity 
— Operator 192 
— of the multipole operators 1065, 1066 
— of a permutation 1383 
Parseval * 
— Parseval-Plancherel relation 1465, 1475 
— Parseval-Bessel relation 27, 1462 
Part (principal) 1488 


Partial waves (method of) 
921, 929, 951, 960, 962 (e), 1450 (e) 


Path (space-time) 333, 1497 
Pauli 
— matrices 417, 973 
— theory 970 
— Hamiltonian 991 (e) 


— exclusion principle 
1389, 1396, 1414, 1433, 1436, 1445 


Penetrating orbit 1414 
Periodic 
— function 1459 
— potential 367, 1159 
— boundary conditions 1165, 1440 
Permutation 
— Operators 1378, 1382 
— transformation of observables by 
1381, 1386 
— parity of 1383 
Perturbations 
— stationary 1094 
— time-dependent 1284 
Phase 
— condition of stationary 25, 58 
— see: velocity 
Phonons 586. 601 
Photoelectric effect 1345, 1361 (e) 
Photons 10, 605, 626, 1336 


— see absorption 
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a xq KK a 


INDEX 
Picture 
— Schródinger or Heisenberg 312 
— interaction 347 (e) 
Plancherel : see Parseval 
Planck 
— Planck-Einstein relations 11, 18 
— constant 11, endpaper 
Plane wave 22, 27, 100, 923, 947 


Podolsky (Einstein-Podolsky-Rosen para- 


dox 294, 1084 
Polarizability 

— of NH, (electrical) 468 

— of acharged harmonic oscillator 555 

— of a hydrogen atom 1280 


Polarization of the Zeeman components 


838, 1275 

Polynomial method 

— harmonic oscillator 535, 815 

— hydrogen atom 794 
Polynomials 

— Hermite 502, 529, 541 

— Legendre 686 
Positron 1262 
Positronium 808, 1262 
Postulates 

— general 214 

— symmetrization 1386, 1406 
Potential 

— operator 171 

— scalar or vector 224, 315, 1493 

— central 776, 814, 21 

— centrifuge 781, 859, 864 

— with cylindrical symmetry 870 (e) 

— periodic 367, 1159 

— scattering by 901 

— Yukawa 957 

— of a magnetic dipole 1248 


— see barrier, well, step, square 
Precession (Larmor) 390, 404, 443, 830 


Principal quantum number 799 
Probability 
— of a measurement result 17, 19, 216 
— amplitude 19, 253, 258 


— current and density 
19, 238, 280, 327, 343 (e) 
— ofa transition 
429, 447, 1289, 1296, 1300, 1319. 1363 (e) 
Product 


— scalar 95, 98, 106, 109, 147, 165 
— of operators 97, 127 
— tensor 153, 199, 292, 306, 430 
Projection theorem 1236 


Projector 114, 138, 168, 217, 221, 264, 1088 (e) 


Propagator 329 
Proton 
— mass endpaper 
— spin and angular momentum 1217, 1248 
— hyperfine coupling 1249 
Pumping 1324, 1330, 1339 (e) 
Pure state 297 


Quadrupole (electric) 


— Hamiltonian 1312 
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— transitions and selection rules 1314 

— moment 1064, 1205 (e) 
Quantization 

— rules 222, 225 

— of the energy 10, 18, 38, 77, 352 


— of the measurement results 
16, 215, 226, 392, 398 


— of the angular momentum 387, 652 
— of the electromagnetic field 613 
Quasi-classical states 559, 738, 763, 771 (e) 
Quenching (metastability) 1282 


Rabi's formula 413, 428, 447, 1286, 1342 


Radial 
— equation 779 
— quantum number 783 
— integral 1258 
Raman effect 516, 723, 1339 (e) 
Rayleigh scattering 516, 726 
Recoil energy 1365 (e) 


Reduced mass 786 
Reduction of the wave packet 220, 226, 265, 278 
Reiche : see Thomas 


Relaxation 452, 1324, 1362 (e), 1363 (e) 
Relative motion 785 
Relativistic corrections 1213, 1429 
Representation 
— generalities 121 
— position and momentum representa- 
tions 144, 182 
Resonance 
— ‘quantum 411, 1139, 1177 
— magnetic 443, 1123, 1128, 1205 (e) 
— scattering 73, 936, 963 (e) 
— for a sinusoidal excitation 
1292, 1331, 1336, 1340 
— integral 1175 
Response (linear, non-linear) 1315. 1323 
Restriction of an operator 168, 1105 
Ritz theorem 1149 
Root mean square deviation 
— general definition 230 
— of two conjugate observables 286 
Rosen (Einstein-Podolsky-Rosen paradox) 
294, LOR4 
Rotation(s) 690 
— operator 693, 983 


— invariance and essential degeneracy 
714, 1056, 1421, 1429 
712, 766 (e), 856 
712, 1202 (e) 


— of molecules 
Rotator (rigid) 


Rutherford’s formula 960 
Rydberg constant endpaper 
Saturation 1335 
Scalar 
— product 95, 98, 106, 109, 147, 165 
— operator 705, 709 
Scattering 
— bya potential 901 
— by a central potential 92) 
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cross section 905, 913, 934, 952 


— stationary states 907, 932 
— amplitude 909, 934 
— integral equation 915 
— resonances 73, 936, 963 (e) 
— shadow scattering 953 
— of spin 1/2 particles 1082, 1087 (e) 
— by a hard sphere 960, 962 (e) 
~ form factor 1360 (e) 
Schottky anomaly 628 
Schródinger equation 19, 222, 236, 299 
— in the position or momentum repre- 
sentations 184, 186 
— solution for conservative systems 245 
Schródinger picture 312 
Schwarz inequality 165 
Section (cross): see cross section 
Secular approximation 1297, 1340 
Selection rules 249 


— for an even or odd operator 195 
— for multipole moments 1069, 1090 (e) 
— see dipole (electric and magnetic) 
and quadrupole (electric) 
Self-consistent potential 1412 
Semiconductor 810, 1445 
Set (Complete Set of Commuting Obser- 
vables : C.S.C.O.) 143, 159, 206 (e), 235 


Shadow scattering 953 
Shell (electronic) 799, 1414 
Singlet 1008, 1426 
Slater determinant 1390 
Solids 

~ energy bands of the electrons 

373, 378, 1156, 1432, 1442 

— hydrogen-like systems in solids 809 
Specitic heat 

— ofa solid 626, 1438 

— of a two-level system 627 

— of an electron gas 1436 


Spectral terms 
Spectroscopic notation 


1418, 1424, 1447 (e) 
800, 1086, 1225, 1424 


Spectrum 
— of an observable 132, 216 
— continuous 138, 218, 263 
Sphere (hard) scattering 960, 962 (e) 
Spherical 


— harmonics : see harmonics 
— free spherical waves 921, 924, 940, 947 
~ Bessel functions 924, 944, 945, 947 
— Neumann functions 947 
Spin 965 
— quantum mechanical description 
388, 970. 974 
— fictitious 424, 1325 
— system of two spin 1/2' s 
430, 1003, 1013, 1018, 1087 (e) 
~ spin-orbit coupling 1001, 1215, 1223, 1261 
— spin-statistics theorem 1387 
Spinor (two-component) 975, 988 
Spontaneous emission 
338, 619, 1282, 1321, 1345, 1355 
Spreading of a wave packet 64. 342 (e) 
Square potentials 34, 67, 79, 269, 280 


INDEX 
Standard 
— representation of the angular momen- 
tum 652, 666 
— components of a vector operator 1089 (e) 
Stark effect 1279 
State 
— vector 108, 214, 235 
— time evolution 222, 308 
— stationary 32, 67, 246, 351 
— eigenstate 17, 132, 216, 231 
— after a measurement 220 
— see density, final 
Stationary phase condition 25, 58 
Stationary states 32, 67, 245 
— of a particle in an arbitrary potential 
well 
— ina periodic potential 367 
— in a central potential 776 
Statistics 
— spin-statistics theorem 1387 
— quantum statistics 1398 
Step (potential) 36, 69, 79, 281 
Step function 1475 
Stern-Gerlach experiment 388 
Stokes (Raman line) 517, 725 


Structure : see fine and hyperfine 


Superposition 
— principle 14, 215, 236, 252 
— coherent superposition 253, 296, 302- 
Susceptibility 
— electric 468, 555, 1280, 1317 
— magnetic 1202 (e), 1438 
— see linear, non-linear, tensor 
Symmetric 
— ket, state 1380, 1384 
— observable 1381, 1393 
Symmetrization 
— of observables 223 
— postulate 1386, 1406 
Symmetrizer 1380, 1384 


Systematic degeneracy : see degeneracy 


Tensor 
— susceptibility 1204 (e). 1358 (e) 
— interaction 1122 
— Operator 1089 (e), 1090 (e) 
Tensor product 
— definition and properties 153 
— applications 199, 430 
— state 292, 306 


Terms (spectral) 
Thermodynamic equilibrium 


1408, 1424, 1447 (e) 


— generalities 304 

— spin 1/2 440 

— harmonic oscillator 620 

— electron gas 1436 
Thomas-Reiche-Kuhn (sum rule) 1419 
Thomas precession 1215 
Torsion oscillations 520 
Townes (Autler-Townes effect) 1359 (e) 
Trace 

— of an operator 166 

— partial 305 
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INDEX 
Transitions — see: partial 
— one or several quanta 1293. 1335, 1358 (e) — free spherical 921, 924, 940 
~= see: probability, forbidden, dipole Wavelength 
(clectric and magnetic), quadrupole — L. de Broglie 18, 42 
(electric) — Compton 798. 1216, endpaper 
Translation operator 190, 556, 762 Wave packet(s) 
Transposition 1383 — generality 21 
Triplet state 1008, 1426 — propagation 28, 63, 243, 402 
Tritium — spreading 64, 342 (e) 
Tunnel effect 37, 74, 357, 461, 523, 1157 — minimum 287, 572 
— free gaussian 61 
Uncertainty relations — ata potential step 79 
— Heisenberg 27, 45, 47, 51, 59, 231, 287 — ina potential well 274 
— time-energy 250, 1294, 1356 — ina harmonic potential $72 
Unitary operator 176, 309 — two-dimensional 54 
— three-dimensional 58 
Vacuum fluctuations 619 p — of lid 220, 226, se Fh 
Valence band 1444 Well cloctron ‘7 2. 
Value (eigen-) see cigenvalue ells (potential) 37. 74 
Value (mean) of an observable 227, 240 ee Scie 96) 
Van der Waals forces 1130 — “square spherical” , 7 (e) 
Variational method — infinite one-dimensional n, eps 
1148, 1162, 1170, 1207 (e), 1208 (e) — infinite two-dimensional ” 
Vector - pr erre 86 (e), A 
— operator 705, 1048, 1089 (e) ie Papo 
— model 1054, 1072, 1237, 1242 Wigues Boker! A 339, 450, 1356 
Vector (state): see state 7 
Velocity 1047, 1048, 1068, 1089 (e), 1236 
— phase 28, 44 
— group 30, 44, 59, 64, 589, 1168 Young's double slit experiment 11 
.— generalized 213, 1487 uba potential 957 
Vibration 
— of a molecule 510, 856 


518, 601, 628 
344 (e). 1191 
526, 812, 1141, 1248 


— of the nuclei in a crystal 
Virial theorem 
Volume effect 
Von Karman : see Born 


Wave(s) 
— wave function 19, 94, 146, 225 
— evanescent 37, 71, 74, 81, 282 
— plane 22, 27, 100, 923, 947 
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Zeeman effect 
— hydrogen atom 
with spin 835 
without spin 1245, 1270 


of the hyperfine structure 1232 
— arbitrary multiple level 969, 1056 
— three-dimensional harmonic oscillator 

870 (e) 
— muonium 1253 
— positronium 1268 


